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SYNTHETIC POLYNUCLEOTIDES 

TECHNICAL FIELD 

The present invention relates to the production of 
recombinant tropoelastins , and variants of these 
5 recombinant tropoelastins, from synthetic 
polynucleotides, and uses of the tropoelastins and 
variants . 

BACKGROUND ART 

There are various forms of tropoelastin that 

10 typically appear to consist of two types of alternating 
domains: those rich in hydrophobic amino acids 
(responsible for the elastic properties) and those rich 
in lysine residues (responsible for cross-link 
formation) . Hydrophobic and cross-linking domains are 

15 encoded in separate exons (Indik et al . , 1987). 

The gene for tropoelastin is believed to be present 
as a single copy in the mammalian genome, and is 
expressed in the form of multiple transcripts, 
distinguished by alternative splicing of the pre-mRNA 

20 (Indik et al, 1990; Oliver et al, 1987) . 

Previous recombinant work with tropoelastin has been 
reported by Indik et al (1990) who achieved modest 
expression of a natural human tropoelastin sequence from 
cDNA. Their product was unstable, the free polypeptide 

25 being rapidly degraded. 

Bressan et al (1987) have reported the cloning of a 
defined naturally occurring segment of chick 
tropoelastin. 

DESCRIPTION OP THE INVENTION 

3 0 The present invention provides for the expression of 

significant amounts of tropoelastins or variants of the 
tropoelastins in recombinant expression systems. 

The present inventors have recognised that 
tropoelastins are proteins which can be used in a variety 

35 of, for instance, pharmaceutical applications, but these 
uses require significant quantities of tropoelastin. 
These quantities could be obtained by cloning naturally 
occurring tropoelastin genes, but the present inventors 
show how they can be more easily obtained by producing 



synthetic polynucleotides adapted to provide enhanced 
expression . 

The present inventors have recognised that because 
tropoelastins have highly repetitive coding sequences, 
5 the tropoelastin genes have the potential to include 
significant numbers of codons which have low usage in 
particular hosts. Codons of low usage can hamper gene 
expression . 

For example, in one tropoelastin coding sequence 

10 described in detail in this application, the natural 
sequence contains of the order of 80 glycine GGA codons 
which comprises 10% of the gene and have low usage in 
Escherichia coli [Fazio et al . , 1988, and Genetics 
Computer Group (GCG) package version 7 -UNIX using Codon 

15 Frequency and Gen Run Data: ecohigh-cod] . 

According to a first aspect of the present 
invention, there is provided a synthetic polynucleotide 
encoding the amino acid sequence of a tropoelastin or a 
variant of the tropoelastin. 

20 The tropoelastin may be a mammalian or avian 

tropoelastin such as human, bovine, ovine, porcine, rat 
or chick tropoelastin. Preferably, the tropoelastin is 
human tropoelastin. 

The synthetic polynucleotide sequence is altered 

25 with respect to the natural coding sequence for the 
tropoelastin molecule or variant so that: 

a) it codes for a tropoelastin sequence or a variant of 
the. tropoelastixL;-. and. 

b) all or some of the codons which hamper expression in 
3 0 the expression system in which the polynucleotide is 

to be expressed, are replaced with codons more 

favourable for expression in the expression system. 

Preferably all, or part, of the 5' or 3' 
untranslated regions, or both, of the natural coding 
35 sequence are excluded from the synthetic polynucleotide. 

Preferably all, or part, of the signal peptide 
encoding region is excluded from the synthetic 
polynucleotide . 



Where the synthetic polynucleotide is prepared from 
assembled oligonucleotides it is preferred to incorporate 
restriction sites in the sequence to facilitate assembly 
of the polynucleotide . 
5 Restriction sites incorporated in the polynucleotide 

sequence are also useful for: 

1. facilitating subcloning of manageable blocks for 
sequence confirmation; 

2. providing sites for later introduction of 
10 modifications to the polynucleotide as insertions, 

deletions or base changes; 

3. facilitating confirmation of correct poly- 
nucleotide assembly by restriction endonuclease 
digestion . 

15 A preferred expression system is an Escherichia coli 

expression system. However, the invention includes 
within its scope synthetic polynucleotides suitable for 
use in other expression systems such as other microbial 
expression systems . These other expression systems 

2 0 include yeast and bacterial expression systems, insect 

cell expression systems, and expression systems involving 
other eukaryotic cell lines or whole organisms . 

Modifications to codon usage to provide enhanced 
expression are discussed in: 
25 Zhang et a2 (1991) for E. coli, yeast, fruit fly and 

primates where codon usage tables are provided; 

Newgard et al (1986) for mammals; and Murray et al 
(1989) for plants. Preferred codon. usages. ..are . indicated, 
in these publications . 

3 0 Preferably, at least 50% of codons for any 

particular amino acid are selected and altered to reflect 
preferred codon usage in the host of choice. 

Preferably, the polynucleotide is a fused 
polynucleotide with the tropoelastin or variant encoding 
3 5 sequence fused to a polynucleotide sequence compatible 
with the host. The compatible sequence is preferably at 
the 5' end of the polynucleotide molecule. 

Preferred compatible polynucleotides include those 



which encode all or part of a polypeptide which causes 
the expressed fusion to be secreted or expressed as a 
cell surface protein so as to facilitate purification of 
the expressed product, or expressed as a cytoplasmic 
5 protein. 

One preferred compatible polynucleotide is one 
encoding all or part of glutathione-S-transf erase . 

In addition the synthetic polynucleotides can encode 
additional residues such as an N-terminal methionine or 
10 f -methionine not present in the natural counterpart. 

A preferred synthetic polynucleotide is one 
comprising the sequence illustrated in Figure 3 (1) to 3 
(5) (SEQ ID NO 1) or a part of it, encoding a polypeptide 
which retains elastic properties. The sequence 

15 illustrated in Figure 3 (1) to 3 (5) is 2210 bp in size. 

To our knowledge, this is the largest synthetic gene 
constructed so far. Previously, the largest was of the 
order of 1.5 kb in size. 

The actual changes made in this sequence in 

2 0 comparison with the natural sequence from which it was 

derived are shown in Figure 6 (1) to 6 (4) comparing the 
synthetic sequence (SEQ ID NO 1) with the natural 
sequence (SEQ ID NO 53) . Synthetic polynucleotides in 
which only some of the base changes shown in that Figure 
25 have been made are also within the scope of the 
invention. 

It is known that tropoelastin genes in nature are 
expr.ess,ed. as., multiple, txan scripts, which are distinguished 
by alternative splicing of the pre-mRNA as described in, 

3 0 for instance: 

Indik et al , 1990; Oliver et al, 1987; Heim et al, 
1991; Raju et al , 1987; and Yeh et al , 1987. The 
tropoelastins of the present invention for which 
synthetic polynucleotides are prepared are intended to 
35 encompass these different splice forms. 

Variants of tropoelastins embodying the present 
invention are polypeptides which retain the basic 
structural attributes, namely the elastic properties, of 
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a tropoelastin molecule, and which are homologous to 
naturally occurring tropoelastin molecules. For the 
purposes of this description, "homology" between two 
sequences connotes a likeness short of identity 
5 indicative of a derivation of one sequence from the 
other. In particular, a polypeptide is homologous to a 
tropoelastin molecule if a comparison of amino-acid 
sequences between the molecules reveals an identity of 
greater than about 65% over any contiguous 20 amino acid 

10 stretch or over any repetitive element of the 
tropoelastin molecule shorter than 20 amino acids in 
length. Such a sequence comparison can be performed via 
known algorithms, such as the one described by Lipman and 
Pearson, Science 227 : 1435 (1985) which are readily 

15 implemented by computer. 

Variants of tropoelastins can be produced by 
conventional site-directed or random mutagenesis. This 
is one avenue for routinely identifying residues of the 
molecule that can be modified without destroying . the 

20 elastic properties of the molecule. 

Oligonucleotide-directed mutagenesis, comprising: 

1. synthesis of an oligonucleotide with a sequence that 
contains the desired nucleotide substitution (mutation) , 

2 . hybridizing the oligonucleotide to a template 
25 comprising a structural sequence coding for tropoelastin 

and 

3. using a DNA polymerase to extend the oligonucleotide 
as a. primer,, is- preferred- because of its- ready utility in 
determining the effects of particular changes to the 

30 structural sequence. Its relative expense may militate 
in favour of an alternative, known direct or random 
mutagenesis method. 

Another approach which is particularly suited to 
situations where the synthetic polynucleotide has been 
35 prepared from oligonucleotide blocks bounded by 
restrictions sites is cassette mutagenesis where entire 
restriction fragments are inserted, deleted or replaced. 

Also exemplary of variants within the present: 
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invention are molecules that correspond to a portion of a 
tropoelastin molecule without being coincident with a 
natural tropoelastin molecule and which retain the 
elastic properties of a natural tropoelastin molecule. 
5 Other variants of tropoelastins of the present 

invention are fragments that retain the elastic 
properties of a tropoelastin molecule. 

Fragments within the scope of this invention are 
typically greater than 20 amino acids in length. 

10 According to a second aspect of the present 

invention there is provided a recombinant DNA molecule 
comprising a synthetic polynucleotide of the first 
aspect, and vector DNA. 

Vectors useful in the invention include plasmids, 

15 phages and phagemids . The synthetic polynucleotides of 
the present invention can also be used in integrative 
expression systems or lytic or comparable expression 
systems. 

Suitable vectors will generally contain origins of 

2 0 replication and control sequences which are derived from 
species compatible with the intended expression host. 
Typically these vectors include a promoter located 
upstream from the synthetic polynucleotide, together with 
a ribosome binding site for prokaryotic expression, and a 

25 phenotypic selection gene such as one conferring 
antibiotic resistance or supplying an auxotrophic 
requirement. For production vectors, vectors which 
provide, for. enhanced . stability, -through partitioning may- 
be chosen. Where integrative vectors are used it is not 

30 necessary for the vector to have an origin of 
replication. Lytic and other comparable expression 
systems do not need to have those functions required for 
maintenance of vectors in hosts. 

Typical vectors include pBR322, pBluescript II SK + , 

35 pGEX-2T, pTrc99A, pET series vectors, particularly pET3d, 
(Studier et al ; 1990) and derivatives of these vectors. 

According to a third aspect of the present invention 
there is provided a transformed host transformed with a 
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recombinant DNA molecule of the second aspect . 

Hosts embodying the invention include bacteria, 
yeasts, insect cells and other eukaryotic cells or whole 
organisms. They are typically bacterial hosts. 
5 A preferred host is an E. coli strain. Examples of 

E. coli hosts include E. coli B strain derivatives 
(Studier et al, 1990), NM522 (Gough and Murray, 1983) and 
XLl-Blue (Bullock et al, 1987) . Hosts embodying this 
invention, for providing enhanced expression of 
10 tropoelastin or tropoelastin variants, are those in which 
the altered codon usage is favourable for expression, and 
with which any control sequences present in the 
recombinant DNA are compatible. 

According to a fourth aspect of the present 
15 invention there is provided an expression product of a 
transformed host of the third aspect which expression 
product comprises a tropoelastin or a variant thereof. 

A preferred expression product of the fourth aspect 
comprises all or part of the amino- acid sequence depicted 
20 in Figure 3 (1) to 3 (5) (SEQ ID NO: 1) . The serine at 
position 1 may be deleted from the product and similarly 
the methionine at position 2 may be deleted. 

Other preferred expression products are those in 
which only some of the base changes shown in Figure 6 (1) 
25 to 6 (4) have been made. Typically at least 50% of the 
indicated base changes have been made. 

The expression products of the fourth aspect may be 
fused, expression, products, ..which... include all or part of a 
protein encoded by the vector in peptide linkage with the 
30 expression product. They may also include, for example, 
an N- terminal methionine or other additional residues 
which do not impair the elastic properties of the 
product . 

Typically the fusion is to the N-terminus of the 
35 expression product. An example of a suitable protein is 
glutathione-S-transf erase . The fused protein sequence 
may be chosen in order to cause the expression product to 
be secreted or expressed as a cell surface protein tc 



simplify purification or expressed as a cytoplasmic 
protein . 

The expressed fusion products may subsequently be 
treated to remove the fused protein sequences to provide 
free tropoelastin or a free tropoelastin variant . 

The expression products of the fourth aspect may 
also be produced from non-fusion vectors such as pND211 
(N. Dixon, Australian National University) . This vector 
has the gene inserted into an Ncol site and uses lambda- 
promoter- driven expression to permit initiation from the 
start codon of the synthetic gene. The sequence of the 
vector is shown at Figure 9 (1) and 9 (2) (SEQ ID NO: 
54) . Other suitable non- fusion vectors include pET3d. 

According to a fifth aspect of the present invention 
there is provided a pharmaceutical or veterinary 
composition comprising an expression product of the 
fourth aspect together with a pharmaceutically or 
veterinarally acceptable carrier. 

Dosage of the expression product and choice of 
carrier will vary with the specific purpose for which the 
expression product is being administered. 

The expression products of the fourth aspect may 
also be prepared in the form of foods or as industrial 
products where elastic or association properties may be 
desired. The tropoelastin expression products of the 
invention can form associations in solution wherein the 
tropoelastin molecules are held together by hydrophobic 
interactions. These associations are termed - 

"coacervates" . They are useful as precursors to elastin 
synthesis. The tropoelastin coacervates can also be used 
as delivery vehicles for active ingredients such as 
pharmaceutical or veterinary agents providing 
biodegradable or biodissociable slow release formulations 
or alternatively protective coatings to protect active 
agents, for instance, during their transit through the 
stomach of a host . 

According to a sixth aspect of the present invention 
there is provided a process for the production of an 



expression product of the fourth aspect comprising: 

providing a transformed host of the third aspect; 
culturing it under conditions suitable for the expression 
of the product of the fourth aspect; and collecting the 
expression product . 

In one preferred form the expression product is 
produced in the form of inclusion bodies which are 
harvested from the transformed host . 

In a seventh aspect of the invention there is 
provided a cross-linked expression product of the fourth 
aspect. The cross -linked expression products form 

elastin or elastin-like products. 

In preparing a synthetic polynucleotide in 
accordance with the first aspect the following procedure 
is followed. 

A cDNA sequence encoding a tropoelastin, or a part 
of it, is selected and the open reading frame is defined. 

The sequence is then translated to provide the 
corresponding amino acid sequence. Alternatively, the 
procedure can commence from a known amino acid sequence . 

The exons which are to be included in the expression 
product are chosen. Preferably, any signal sequence or 
untranslated regions will not be included in the 
synthetic polynucleotide. 

The amino acid sequence selected is then converted 
to a polynucleotide sequence on the basis of codon usage 
frequencies. By selecting the most commonly used codon 
for each amino acid for the host in which expression is. 
desired, a skewed usage arises because particular codons 
may have very different frequencies of usage. It is 
therefore necessary to adjust the codon usage of at least 
the most common codons, that is, those present at greater 
than 20 occurrences, to more closely match levels of 
codon usage in the host of choice. 

It is preferable to alter the sequence to introduce 
restriction sites at regular intervals throughout the 
sequence where these represent silent alterations, that 
is, they do not change the resulting amino acid. In 
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addition ends suitable for ligation, eg BamHI and/or Ncol 
sites can be introduced into the sequence. 

Tropoelastin sequences described for various 
organisms are similar, particularly at the level of exon 
5 structure and the organisation of hydrophilic and 
hydrophobic domains. In selecting exons to be included 
in the expression product we have adopted an approach 
whereby we leave in exons known to occur in all available 
tropoelastins . Depending on the intended use of the 

10 resulting tropoelastin, additional exons, or synthetic 
sequences, or both, are included. For instance, in the 
human example provided we included exon 10A which only 
occurs in some of the known sequences for human 
tropoelastin. In the bovine case, a typical addition 

15 would be exons 4A, 6 and/or 9 (Raju and Anwar, 1987; Yeh 
et al, 1987) . In the rat case, a typical addition would 
be exons corresponding to exons 12 through 15 of the 
bovine case. (Heim et al 1991). 

The construction of the synthetic polynucleotide of 

2 0 Figures 3 and 6 will now be described in more detail. 

The synthetic tropoelastin gene described here 
differs from the natural coding sequence (s) in a number 
of ways. The untranslated regions present in the 
tropoelastin cDNA sequence were disregarded in designing 

2 5 the synthetic gene, and the nucleotides encoding the 
signal peptide were removed. Restriction endonuclease 
recognition sites were incorporated at regular intervals 
into the gene by typically altering, only the third hase_ 
of the relevant codons, thereby maintaining the primary 

30 sequence of the gene product. The facility for silent 
alteration of the coding sequence was also exploited to 
change the codon bias of the tropoelastin gene to that 
commonly found in highly expressed E.coli genes. 
[Genetics Computer Group (GCG) package version 7 -UNIX 

35 using Codon Frequency and Gen Run Data: ecohigh-cod] . 
Two additional stop codons were added to the 3' -end, and 
an ATG start codon comprising a novel Ncol site was 
appended to the 5 '-end. Bam HI cloning sites were 



engineered at both ends of the synthetic sequence. Since 
the gene contains no internal methionine residues, 
treatment of the newly-synthesized gene product 
(expressed directly or as a fusion with another gene) 
with cyanogen bromide would liberate a protein with the 
same or similar sequence as one form of natural 
tropoelastin comprising 731 amino acids. Other forms of 
processing are envisaged, which may generate tropoelastin 
species of the same or different lengths. 

Two stop codons were added in order to allow the 
possible use of the construct in suppressor hosts, and 
also to avoid any potential depletion of termination 
(release) factors for translation. 

The inclusion of an ATG site is useful because: (1) 
it provides an appropriate restriction site for cloning, 
although this is a flexible property; (2) it provides a 
potential start codon for translation of an unfused 
synthetic gene; and (3) it introduces a methionine which 
can be cleaved by cyanogen bromide to release the 
tropoelastin species. However, another method of 

cleavage would not necessarily rely upon the availability 
of this methionine. 

Fusion can provide a more stably expressed protein, 
and experience of other workers has suggested that 
unfused tropoelastin may be unstable (Indik et al . , 
1990) . The fusion is typically to the carboxy terminus 
of the fusion protein (i.e. the N-terminus of the 
tropoelastin) . Glutathione-S-transf erase (Smith. and 
Johnson, 1988) is an example of a suitable fusion 
protein. 

A convergent approach was used in assembly and 
cloning of the synthetic human tropoelastin (SHEL) 
sequence. Groups of six, and in one case, eight, 
oligonucleotides were annealed and ligated together to 
generate eight synthetic blocks of approximately 260- 
300bp, designated SHEL1-8. These blocks were cloned 
independently into pBluescript II SK + ; the assembly and 
cloning scheme for SHEL1 is illustrated in Figure 1. 



- 12' - 

Following sequence confirmation, the blocks were excised 
from their parent plasmids and used to construct three 
clones, pSHEL a,& and y, each containing approximately 
700-800bp of the synthetic gene. The final step towards 
5 assembly of the complete SHEL gene involved ligation of 
the inserts from each of these three intermediary clones 
into pBluescript II SK + to produce pSHEL. The cloning 
scheme is illustrated in Figure 2 . 

The tropoelastin or variant produced as an 
10 expression product from vectors such as pSHEL can be 
chemically cross-linked to form an elastin product. 
Three available procedures are : 

1. chemical oxidation of lysine side chains which are 
conducive to cross -linking [eg ruthenium tetroxide- 

15 mediated oxidation, via the amide (Yoshifuji S; Tanaka K; 
and Nitto Y (1987) Chem. Pharm Bull 35 2994-3000) and 
quinone -mediated oxidation] ; 

2. homobi functional chemical cross-linking agents, such 
as dithiobis ( succinimidylpropionate ) , dimethyl 

20 adipimidate and dimethyl pimelimidate . There are many 
other amine-reactive cross-linking agents which could be 
used as alternatives; and 

3. cross-linking via lysine and glutamic acid side 
chains as taught by Rapaka et al (1983) . 

25 The tropoelastins or variants of the invention may 

also be enzymatically cross-linked to form an elastin or 
elastin-like product. Enzymatic methods include lysyl 
oxidase-mediated oxidation of the tropoelastin or variant 
via modification of peptidyl lysine [Beddell-Hogan et al 

30 (1993)]. Oxidised lysines participate in the generation 
of cross-linkages between and within tropoelastin 
molecules. Other modification enzymes can be used 
forming cross-links via lysine or other residues. 

Cross -linking can also be achieved by gamma 

35 irradiation using, for instance, techniques adapted from 
Urry et al (1986) . 

Tropoelastins or variants of the invention cross- 
linked to form elastin or elastin-like products are also 



within the scope of the invention. 

The half-lives of the products in free solution will 
determine the suitability of a particular agent for a 
particular application. 

For example, the hydrolytic breakdown of the cross - 
linked material will be useful in applications, such as 
surgical applications, where the gradual loss of material 
over time is intended. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention is further described with 
reference to the accompanying drawings in which: 

Figure 1 shows the scheme for construction and 
cloning of SHEL1, one of the eight intermediary 
subassemblies used to generate the SHEL sequence. A 
similar approach was adopted for each of the remaining 
blocks (sHEL 2-8) . See materials and methods section for 
details. 5 ' -phosphorylated oligonucleotides are indicated 
with a black dot (•) . 

Figure 2 shows the cloning scheme for the synthetic 
human tropoelastin (SHEL). - Abbreviations: B,Bam HI; 
U.Hindlll; K,KpnI; N.NotI; P,PstI; S,Sad; Sp,SpeI. 

Figure 3 (1) to 3 (5) shows over 5 drawing sheets 
the full nucleotide sequence (SEQ ID NO: 1) and 
corresponding amino acid sequence (SEQ ID NO: 2) for the 
synthetic human tropoelastin (SHEL) . Coding (+) strand 
of the sHEL gene construct is shown on the upper 
(numbered) sequence line. Synthetic complementary (-) 
strand sequence is shown immediately beneath it. The 
amino acid sequence of the synthetic gene product is 
indicated below the nucleotide sequence. 

Figure 4 (1) to 4 (2) shows over 2 drawings sheets 
the sequences for the oligonucleotides (SEQ ID NOS: 3 to 
27) used to construct the synthetic human tropoelastin 
(SHEL) sequence: ( + ) - strand oligonucleotides. 

Figure 5 (l) to 5 (2) shows over 2 drawing sheets 
the sequences for the oligonucleotides (SEQ ID NOS: 28 to 
52) used to construct the synthetic human tropoelastin 
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(SHEL) sequence: (-) - strand oligonucleotides. 

Figure 6 (1) to 6 (4) shows over 4 drawing sheets 
the differences in nucleotide sequence between SHEL (SEQ 
ID NO: l) and a cDNA form of the coding region of the 
human tropoelastin gene (SEQ ID NO: 53). The coding (+)- 
strand of the synthetic (SHEL) sequence is shown on the 
top (numbered line) . The cDNA sequence is indicated 
below it, showing only those nucleotides which differ 
from the synthetic sequence. 

Figure 7 shows the results of SDS-PAGE analysis of 
tropoelastin fusion protein expression from pSHELC. Lane 
1: standards; Lane 2: non-induced; Lane 3: induced. The 
arrow points to the overexpressed fusion protein. 

Figure 8 shows the correlation between predicted and 
observed amino acid content for the fusion protein 
expressed from pSHELC: —A- Net data (%) 

--0-- Expected (%) 

Figure 9 (l) to 9 (2) over 2 drawing sheets shows 
the sequence (SEQ ID NO: 54) of the plasmid vector 
pND211 . 

Figure 10 shows the results of SDS-PAGE analysis of 
tropoelastin expression from pSHELF. 

Lane 1: standards; Lane 2: induced; Lane 3: 
uninduced: Lane 4: alcohol -purified sample; Lane 5: 
additional lane of alcohol purified sample. 

Figure 11 shows the correlation between predicted 
and observed amino acid content for tropoelastin... 
expressed from pSHELF. 

BEST METHOD OF PERFORMING THE INVENTION 

The recombinant and synthetic techniques used are 
standard techniques which are described in standard texts 
such as Sambrook et al (1989) . 

Purification of the expression products is also 
performed using standard techniques, with the actual 
sequence of steps in each instance being governed by the 
host/expression product combination. 
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The pharmaceutical and veterinary compositions are 
formulated in accordance with standard techniques . 

The amount of expression product that may be 
combined with carrier to produce a single dosage form 
will vary depending upon the condition being treated, the 
host to be treated and the particular mode of 
administration . 

It will be understood, also, that the specific dose 
level for any particular host will depend upon a variety 
of factors including the activity of the expression 
product employed, the age, body weight, general health, 
sex, diet of the patient, time of administration, route 
of administration, rate of excretion, drug combination, 
etc . 

The compositions may be administered parenterally in 
dosage unit formulations containing conventional, non- 
toxic, pharmaceutically and/or veterinarally acceptable 
carriers, diluents, adjuvants and/or excipients as 
desired. 

Injectable preparations, for example, sterile 
injectable aqueous or oleagenous suspensions may be 
formulated according to the known art using suitable 
dispersing or wetting agents and suspending agents. The 
sterile injectable preparation may also be a sterile 
injectable solution or suspension in a ' non- toxic 
parenterally acceptable diluent or solvent. Among the 
acceptable vehicles or solvents that may be employed are 
water, Ringer's solution, and isotonic, sodium . ...chloride - 
solution. In addition, sterile, fixed oils are 

conventionally employed as a solvent or suspending 
medium. For this purpose any bland fixed oil may be 
employed including synthetic mono- or diglycerides . In 
addition, fatty acids such as oleic acid and organic 
solvents find use in the preparation of injectables. 

Routes of administration, dosages to be administered 
as well as frequency of administration are all factors 
which can be optimised using ordinary skill in the art. 

In addition, the expression products may be prepared 
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as topical preparations for instance as ant i -wrinkle and 
hand lotions using standard techniques for the 
preparation of such formulations. They may be prepared 
in aerosol form for, for instance, administration to a 
5 patient's lungs, or in the form of surgical implants, 
foods or industrial products by standard techniques. 

The tropoelastins can be cross-linked either 
chemically, enzymatically or by irradiation to form 
elastin products for use in applications such as 

10 pharmaceutical applications, surgical, veterinary and 
medical applications, cosmetic applications, and in 
industrial uses. Tropoelastin coacervates can be used to 
formulate slow release compositions of active ingredients 
or to form protective coatings for active ingredients 

15 using standard formulation techniques . 
Materials and Methods 
Materials 

Restriction enzymes, T4 polynucleotide kinase and T4 
DNA ligase were obtained from Boehringer Mannheim, Progen 

2 0 Industries or New England Biolabs . Gelase was obtained 
from Epicentre Technologies. Reagents for solid-phase 
oligodeoxynucleotide synthesis were obtained from Applied 
Biosystems (ABI) . Low melting temperature (LMT) agarose 
was obtained from Progen or FMC and a- 35 S-dATP was 

25 obtained from Amersham International. Plasmid vectors 
pBluescript II SK + and pGEX-2T were obtained from 
Stratagene and Medos Co Pty Ltd respectively. pET3d was 
obtained from F.W. Studier at Brookhaven National- 
Laboratory, NY, U.S.A. EL coli strains HMS174 and BL21 

30 (DE3) are described in Studier et al (1990). 

Oligodeoxynucleotide Synthesis and Purification 

Oligonucleotides were synthesized on 40nmol-scale 
polystyrene -support columns on an Applied Biosystems 381A 
or 394 DNA synthesis machine. Standard ABI protocols 

35 were employed for synthesis, including chemical 5'- 
phosphorylation where appropriate. Detritylation was 
performed automatically, and cleavage from the solid 
support effected manually (381A) or automatically (394) 
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according to the synthesizer used. Base protecting 
groups were removed by heating the aramoniacal 
oligonucleotide solution at 55-60°C overnight. 
Deprotected oligonucleotides were lyophilized, dissolved 
in 400/il TE buffer and ethanol precipitated prior to 
resuspension in 100/il 50% deionized formamide in TE. 

All oligonucleotides used in construction of the 
sHEL gene were purified by denaturing PAGE before use. 
160mm x 100mm x 1.5mm polyacryl amide gels containing 7M 
urea were used for this purpose. Short oligonucleotides 
(<40-mers) were purified on 20% gels whilst long 
oligonucleotides (>85-mers) were purified on gels 
containing 8-10% acrylamide (acrylamide : bisacrylamide 
19:1). Samples were heated to 75°C for 3 min before 
loading. Tracking dye (0.05% bromophenol blue, 0.05% 
xylene cyanole FF in deionized formamide) was loaded into 
an adjacent lane. Electrophoresis was conducted at 
constant power (17W) until the bromophenol blue marker 
was within 1cm of the base of the gel. The apparatus was 
disassembled and the gel wrapped in cling film. Product 
bands were visualized by UV-shadowing over a fluorescent 
TLC plate. Excised gel fragments containing purified 
oligonucleotides were transferred to microcentrifuge 
tubes, crushed and soaked overnight at 60°C in 500^1 
elution buffer (0.3M sodium acetate pH7.0). A second 
extraction was performed with 400/il elution buffer, for 
3-4h at 60 °C and the supernatant combined with that of 
the first extraction. The total volume of... the... 

oligonucleotide-containing solution was reduced to 
approximately 400/il by butan-l-ol extraction and DNA 
precipitated by addition of 1ml ethanol. Purified 
oligonucleotide was pelleted by centrif ligation, 
redissolved in 20/il TE buffer and quantified by 
spectrophotometry. The final yield of purified 

oligonucleotide obtained in this manner was typically 10- 
30/xg. 

Construction of Synthetic Gene 'Blocks* (sHELl-8) 

Complementary oligonucleotides (3 0pmol each, approx 
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l//g for 95-mers) were annealed in 10/il buffer containing 
50mM Tris.HCl pH7 . 5 , lOmM MgCl 2 . The mixture was 

overlayed with 12/zl paraffin oil, heated to 95°C and 
cooled slowly to 16°C (16h) in a microprocessor- 
5 controlled heating block (Perkin Elmer Cetus Thermal 
Cycler) . Annealed samples were transferred to clean 
microcentrifuge tubes and a small aliquot withdrawn 
for analysis by agarose gel electrophoresis (2%LMT gel, 
TBE running buffer) . For each block comprising three 

10 complementary oligonucleotide pairs, four separate 
ligation reactions were set up. Each contained 50mM 
Tris.HCl pH7.5, lOmM MgCl 2 , lmM ATP, 3mM DTT, 3^1 each of 
the appropriate annealed samples, 0.5/il (0.5U) T4 DNA 
ligase and Milli-Q water to a total volume of 10/xl . All 

15 components except the ATP, DTT and T4 ligase were mixed 
and heated to 55°C for 5 min to denature cohesive termini 
and cooled to room temperature before addition of the 
remaining components. Ligation reactions were incubated 
overnight at 16°C and analysed on 2% LMT agarose gels, 

20 with TBE as running buffer. Ligated blocks were purified 
by preparative agarose gel electrophoresis using 2% LMT 
agarose gels with TAE running buffer. Product bands were 
identified under long-wave UV illumination with reference 
to known DNA size standards (pBluescript II SK + digested 

25 with ■ Hae III) and excised in the minimum possible volume 
of gel . DNA was recovered from LMT agarose fragments 
using Gelase® in accordance with the manufacturer's 
instructions,., ("fast," ,. protocol ) ... Purity., and yield o'f 
recovered sHEL blocks was assessed by analytical agarose 

30 electrophoresis alongside known DNA size standards. 
Block 8 was created by a slightly different strategy. 
The first 3 oligonucleotide pairs (numbers 22, 23, 24, 
47, 48 and 49) were assembled and purified as described 
for blocks 1 to 7, after which the remaining 

35 oligonucleotide pair (numbers 25 and 50) was ligated 
under conditions described above. The full length block 
8 was purified as described for blocks 1 to 7 . 



The oligonucleotides used for preparing each of the 
blocks shown in Figures 4 (1) to 4 (2) and 5 (l) to 5 (2) 



were 


assembled as 


follows : 










Block 


+ strand 


Seg 


ID 


-strand 


Seq 


ID 




oligonucleotides 




oligonucleotides 






1 


1,2,3 


3 - 


5 


C O '"7 O Q 

Zb , Z 1 , Z o 






2 


4,5,6 


6 - 


8 


2 9,30,31 








7,8,9 


9 - 










4 


10, 11, 12 


12 - 










5 


13, 14, 15 


15 - 


17 


38,39,40 


40 - 


■ 42 


6 


16, 17, 18 


18 - 


20 


41,42,43 


43 - 


• 45 


7 


19,20,21 


21 - 


23 


44,45,46 


46 - 


• 48 


8 


22,23,24,25 


24 - 


27 


47,48,49, 50 


49 - 


52 



Blocks 1-8: Cloning 

15 pBluescript II SK + DNA was digested with appropriate 

restriction enzymes and purified at each stage by 
preparative gel electrophoresis (1% agarose, TAE buffer) . 
Plasmid DNA was isolated from agarose using a proprietary 
DNA purification matrix (Prep-A-Gene , Bio-Rad) . 

20 Approximately lOOng (ca. 0.05pmol) of purified plasmid 
fragment was added to 50ng (ca. 0.3pmol) synthetic block 
in 17/zl buffer containing 50mM Tris.HCl pH7.5, lOmM MgCl 2 
and the mixture heated at 55 °C for 5 min to denature 
cohesive termini. Upon cooling to room temperature, 2^1 

25 lOmM ATP, 3 0mM DTT and Ifil T4 DNA ligase (1U) were added 
and the reaction incubated overnight at 16°C. TE buffer 
was added to a final volume of 50/xl and DNA precipitated 
with 150/il ethanol . Pelleted DNA was dissolved in 10/il 
TE and 1/xl of the solution used to transform E. coli XL1- 

30 Blue (Bullock et al, 1987) by electroporation . 
Transf ormants were selected on LB plates containing 
ampicillin (50/zgml -1 ) , IPTG (O.lmM) and X-gal (80/igml" 1 ) . 
Clones were screened following DNA extraction by 
restriction mapping and DNA sequence analysis. 



35 The restriction enzymes used to digest pBluescript 

II SK + for the cloning of each of these blocks were as 
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follows : 



Block pBluescript II SK + digested with: 

1 Kpnl, BajnHI 

2 Kpnl, Hindi 1 1 
5 3 Hindi 1 1, No tl 

4 NotI, Sacl 

5 Spel , Sacl 
G Kpnl , Spel 
7 Kpnl, Ps tl 

10 8 BamS.1, PstI 



Construction of pSHELa,S and 7 

Two (pSHEL?) or three (pSHELa,J5) blocks were ligated 
into pBluescript II SK + in a single reaction. Each block 
was excised from the appropriate pBluescript II SK + 

15 -derived plasmid and purified by preparative agarose gel 
electrophoresis. 25ng (ca. 0.15pmol) of each synthetic 
block (eg. blocks 1-3 in the case of pSHELa) and 150ng 
(ca. 0.075pmol) of the appropriate pBluescript II SK + 
fragment were ligated in a total reaction volume of 2 0/zl 

20 under conditions similar to those used to assemble the 
individual blocks. Transf ormants were screened by 

restriction analysis. The digestion schemes are 

illustrated in Figure 2. 

Final Assembly of the SHEL gene 

25 The three gene subassemblies- pSHELaf> £- and 7 were- 

excised from their parent plasmids by treatment with the 
appropriate restriction enzymes (see cloning scheme) and 
purified by agarose gel electrophoresis. lOOng of 
pBluescript II SK + DNA linearised with BairiRl and treated 

30 with calf alkaline phosphatase. This and 50ng (ca. 
O.lOpmol) of each subassembly were ligated at 16 °C for 1 
hour using the DNA Ligation Kit (Amersham International 
pic) according to the supplied protocol. Transf ormants 
were selected on LB-ampicillin plates containing IPTG and 

35 X-gal, and analysed by restriction mapping. The two 
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orientations of the SHEL gene in pBluescript were 
designated pSHELA and pSHELB. 



Expression 

The full length SHEL gene was excised from pSHELB 
5 with BamHI and purified by gel electrophoresis. 200ng of 
the purified fragment was ligated with lOOng pGEX-2T 
linearized with BanMI and treated with calf alkaline 
phosphatase using the DNA Ligation Kit (Amersham 
International pic) according to the supplied protocol. 

10 Transf ormants were selected on LB-ampicillin plates and 
screened by restriction mapping. The SHEL gene cloned 
into pGEX-2T was designated pSHELC. 

Small scale expression of pSHELC was achieved by 
growing 5ml cultures of E.coli DH5a containing pSHELC in 

15 LB with 50/xg/ml ampicillin and 0.2% glucose at 37°C 
overnight. 250^1 was subinoculated into 5ml 2TY and 
grown to an A 600 of approximately 0.8 before being induced 
with ImM IPTG. Cultures were grown for a further 3 hours 
before harvesting. For the analysis of total cell 

20 protein 1ml culture was harvested by centrif ugation and 
resuspended in 200/xl SDS-PAGE loading buffer. 20/xl 
samples were boiled for 5 minutes before being analysed 
on an 8% SDS-PAGE gel. For the analysis of soluble and 
insoluble protein, the bacterial pellet from 3ml culture 

25 was resuspended in 500^1 lysis buffer (50mM Tris-HCl pH 
8, ImM EDTA, lOOmM NaCl) and lysed by the addition of 
lmg/ml- lysozyme at 4°C for 30 minutes- followed by 1% 
triton X-100 for 20 minutes. After the addition of 0.1 
mg/ml DNase samples were sonicated. The samples were 

3 0 centrif uged for 15 minutes in a microfuge and the pellet 
resuspended in an identical volume of lysis buffer as 
supernatant. 20/il samples of supernatant and resuspended 
pellet were boiled for 5 minutes and analysed by 8% SDS- 
PAGE. (Figure 7) . The calculated size of the protein 

35 from SDS-PAGE was 86kD which is in close agreement with 
the predicted size of 90kD. The protein was over 75% 
soluble under the conditions used. Total amino acid 
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content of the fusion protein was determined and the 
results show a high correlation with the predicted values 
(Figure 8) . The total level of expression was determined 
using SDS-PAGE and scanning densitometry and was found to 
5 be in excess of 100 mg/1 . 

After purification of GST away from SHEL a yield of 
up to 70 mg/1 could theoretically be obtained. 

Even allowing for losses during purification this is 
a highly significant improvement over 4 mg/1 obtained 

10 with cDNA clones (Indik et al 1990) . Optimising codon 
preference has therefore increased the potential yield of 
tropoelastin fifteenfold. 

Alternatively, the SHEL gene was excised from pSHELB 
with both Ncol and Bawill and purified as above. lOOng of 

15 the purified fragment was ligated to 50ng pET3d, 
previously digested with Ncol and BairHI , using the 
Amersham DNA Ligation Kit to give pSHELF. pSHELF was 
used to transform E.coli HMS174. After confirmation, 
pSHELF was extracted from HMS174 and used to transform 

20 BL21. In both cases, transf ormants were selected on LB- 
ampicillin plates and screened by restriction mapping. 

For pSHELF expression, 5ml LB containing 50/igml" 1 
ampicillin was inoculated with a single colony of E.coli 
BL21 (DE3) containing pSHELF and incubated overnight at 

25 37°C with shaking. 0.25ml of this culture was used to 
inoculate 5ml fresh LB containing 50/xgml" 1 ampicillin and 
grown to early log phase (A 600 =0 . 8 approx) . IPTG was 
added , to a. . final- concentration- of 0.4mM and - growth 
continued for a further 3h. Total cellular protein was 

3 0 -analysed as for pSHELC. Cell lysates were prepared by 
resuspension of the cell pellet in 9 volumes lysis buffer 
and incubation at 4°C for 30min with lmgml" 1 lysozyme. 
PMSF was added to 0 . 5mM before the mixture was twice 
frozen in liquid nitrogen and thawed at 37°C. DNase was 

35 added to a concentraton of 0. lmgml" 1 with lOmM MgCl 2 and 
incubated for 2 0min at room temperature or until the 
solution was no longer viscous. Insoluble material was 
removed by centrif ugation at 20 OOOrpm for 25min. 
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The soluble cell lysate from 125ml culture was 
extracted by use of a modified version of a technique 
previously described for tropoelastin isolation (Sandberg 
et a 1 . , 1971). 1.5 volumes of n-propanol was added to 
the lysate in five aliquots over 2 hours followed by 2.5 
volumes of n-butanol . All additions were performed at 
4°C with constant stirring and the mixture was allowed to 
extract overnight. The precipitated protein was removed 
by centrifugation for 15min at 10 OOOrpm. The soluble 
alcohol fraction was frozen and dried via a vacuum pump 
coupled to a liquid nitrogen trap. The residue was 
dissolved in 3.5ml 25mM HEPES pH 8.0 and dialyzed against 
1 1 of the same buffer for 2 hours, changed to fresh 
buffer and dialyzed overnight. The butanol precipitated 
protein was dissolved in an identical volume SDS-PAGE 
loading buffer and both fractions were analyzed by SDS- 
PAGE. 

The butanol-extracted protein containing SHEL was 
further purified by size fractionation using a Superose 
12 column and FPLC (Pharmacia) . Protein was eluted using 
25mM HEPES, pH 8.0. at a flow rate of 0.5 mlmin" 1 . 

Protein concentration was estimated using a Bradford 
assay (Ausubel et al . , 1989). 

Scanning densitometry of gels was performed on a 
Molecular Dynamics Personal Densitometer and analyzed 
using ImageQuant software. 

From SDS-PAGE the directly-expressed SHEL was 
calculated as being 64kDa (Eigure. . 10... ) which., is. as 
predicted. Total amino acid content was determined and 
was found to be in close agreement with predictions 
further confirming the nature of the overexpressed 
protein. The analysis (Figure 11) performed omits lysine 
residues . 

Scanning densitometry of gels was used to estimate 
the relative level of overexpression . SHEL was expressed 
at a level of approximately 17% total cell protein in the 
range 20-200kDa. This represents a substantial level of 
overexpression and confirms the value of codon 
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manipulation for high level expression. 

As a result of the high levels of expression large 
quantities of tropoelastin were obtained which can be 
used for further studies. The directly expressed SHEL 
protein appeared stable and the rapid degradation seen 
previously with cDNA expression (Indik et al . , 1990) was 
not observed. Therefore, the purification of the free 
polypeptide was pursued in preference to fusion protein. 
A technique utilizing tropoelastin ' s high solubility in 
short -chained alcohols has been used previously in the 
extraction and purification of tropoelastin from tissues 
(Sandberg et al., 1971). This method was modified for 
use with soluble cell lysates and found to be very 
effective. SHEL was selectively extracted into the 
alcohols while the majority of contaminating protein was 
precipitated and removed (Fig. 10) . The yield of SHEL 
after this step was high (greater than 90%) despite some 
loss (less than 10%) by precipitation. The resulting 
SHEL was of high purity as judged by SDS-PAGE after 
Coomassie staining (estimated by eye to be of the order 
greater than 80%) . A gel filtration step was used to 
remove the contaminating protein after which the SHEL was 
of sufficient purity for further characterization. 

Cross-linking of tropoelastin 

Tropoelastin obtained from PSHELF (0.3 mg/ml) was 
chemically cross-linked using lmM dithiobis 
(succinimidylpropionate). at 37°C to generate,. an...ins.olub-le 
material with elastin-like properties. Cross-linking was 
demonstrated by boiling in the presence of sodium dodecyl 
sulphate (SDS) followed by SDS-polyacryl amide gel 
electrophoresis. Cross-linked material did not enter the 
gel under conditions designed to allow entry of uncross - 
linked material. 

Industrial Applications 

Cosmetic Applications 

Recombinant tropoelastin is similar or identical to 
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material found in skin and other tissues and involves no 
animal death in order to make it. It adds to our own 
skin's supply of tropoelastin . Recombinant tropoelastins 
can be used in humans or animals. 
5 Additionally, methods such as liposome technology 

may be considered to deliver substances deep within the 
skin. 

Another significant area of use for tropoelastin is 
in minimising scar formation. The availability of large 
10 amounts of recombinant tropoelastin means that it should 
be possible to test whether the scarring obtained from 
severe cuts and burns can be minimised by regular 
application of tropoelastin to the affected area. 
Increased skin elasticity will counter the rigid effects 
15 of collagen buildup associated with scar formation, both 
in human and veterinary applications. 
Surgical and Veterinary Applications 

The tropoelastins and variants of this invention may 
be used in the repair and treatment of elastic and non- 
20 elastic tissues. They may also be used as food 
supplements . 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 





GATCCATGGG 


TGGCGTTCCG 


GGTGCTATCC 


CGGGTGGCGT 


TCCGGGTGGT 


GTATTCTACC 


60 




CAGGCGCGGG 


TCTGGGTGCA 


CTGGGCGGTG 


GTGCGCTGGG 


CCCGGGTGGT 


AAACCGCTGA 


120 




AACCGGTTCC 


AGGCGGTCTG 


GCAGGTGCTG 


GTCTGGGTGC 


AGGTCTGGGC 


GCGTTCCCGG 


180 


5 


CGGTTACCTT 


CCCGGGTGCT 


CTGGTTCCGG 


GTGGCGTTGC 


AGACGCAGCT 


GCTGCGTACA 


240 




AAGCGGCAAA 


GGCAGGTGCG 


GGTCTGGGCG 


GGGTACCAGG 


TGTTGGCGGT 


CTGGGTGTAT 


300 




CTGCTGGCGC 


AGTTGTTCCG 


CAGCCGGGTG 


CAGGTGTAAA 


ACCGGGCAAA 


GTTCCAGGTG 


360 




TTGGTCTGCC 


GGGCGTATAC 


CCGGGTGGTG 


TTCTGCCGGG 


CGCGCGTTTC 


CCAGGTGTTG 


420 




GTGTACTGCC 


GGGCGTTCCG 


ACCGGTGCAG 


GTGTTAAACC 


GAAGGCACCA 


GGTGTAGGCG 


480 


10 


GCGCGTTCGC 


GGGTATCCCG 


GGTGTTGGCC 


CGTTCGGTGG 


TCCGCAGCCA 


GGCGTTCCGC 


540 




TGGGTTACCC 


GATCAAAGCG 


CCGAAGCTTC 


CAGGTGGCTA 


CGGTCTGCCG 


TACACCACCG 


600 




GTAAACTGCC 


GTACGGCTAC 


GGTCCGGGTG 


GCGTAGCAGG 


TGCTGCGGGT 


AAAGCAGGCT 


660 




ACCCAACCGG 


TACTGGTGTT 


GGTCCGCAGG 


CTGCTGCGGC 


AGCTGCGGCG 


AAGGCAGCAG 


720 




CAAAATTCGG 


CGCGGGTGCA 


GCGGGTGTTC 


TGCCGGGCGT 


AGGTGGTGCT 


GGCGTTCCGG 


780 


15 


GTGTTCCAGG 


TGCGATCCCG 


GGCATCGGTG 


GTATCGCAGG 


CGTAGGTACT 


CCGGCGGCCG 


840 




CTGCGGCTGC 


GGCAGCTGCG 


GCGAAAGCAG 


CTAAATACGG 


TGCGGCAGCA 


GGCCTGGTTC 


900 




CGGGTGGTCC 


AGGCTTCGGT 


CCGGGTGTTG 


TAGGCGTTCC 


GGGTGCTGGT 


GTTCCGGGCG 


960 




TAGGTGTTCC 


AGGTGCGGGC 


ATCCCGGTTG 


TACCGGGTGC 


AGGTATCCCG 


GGCGCTGCGG 


1020 




TTCCAGGTGT 


TGTATCCCCG 


GAAGCGGCAG 


CTAAGGCTGC 


TGCGAAAGCT 


GCGAAATACG 


1080 


20 


GAGCTCGTCC 


GGGCGTTGGT 


GTTGGTGGCA 


TCCCGACCTA 


CGGTGTAGGT 


GCAGGCGGTT 


1140 




TCCCAGGTTT 


CGGCGTTGGT 


GTTGGTGGCA 


TCCCGGGTGT 


AGCTGGTGTT 


CCGTCTGTTG 


1200 




GTGGCGTACC 


GGGTGTTGGT 


GGCGTTCCAG 


GTGTAGGTAT 


CTCCCCGGAA 


GCGCAGGCAG 


1260 




CTGCGGCAGC 


TAAAGCAGCG 


AAGTACGGCG 


TTGGTACTCC 


GGCGGCAGCA 


GCTGCTAAAG 


1320 




CAGCGGCTAA 


AGCAGCGCAG 


TTCGGACTAG 


TTCCGGGCGT 


AGGTGTTGCG 


CCAGGTGTTG 


1380 


25 


GCGTAGCACC 


GGGTGTTGGT 


GTTGCTCCGG 


GCGTAGGTCT 


GGCACCGGGT 


GTTGGCGTTG 


1440 




CACCAGGTGT 


AGGTGTTGCG 


CCGGGCGTTG 


GTGTAGCACC 


GGGTATCGGT 


CCGGGTGGCG 


1500 




TTGCGGCTGC 


TGCGAAATCT 


GCTGCGAAGG 


TTGCTGCGAA 


AGCGCAGCTG 


CGTGCAGCAG 


1560 




CTGGTCTGGG 


TGCGGGCATC 


CCAGGTCTGG 


GTGTAGGTGT 


TGGTGTTCCG 


GGCCTGGGTG 


1620 




TAGGTGCAGG 


GGTACCGGGC 


CTGGGTGTTG 


GTGCAGGCGT 


TCCGGGTTTC 


GGTGCTGGCG 


1680 


30 


CGGACGAAGG 


TGTACGTCGT 


TCCCTGTCTC 


CAGAACTGCG 


TGAAGGTGAC 


CCGTCCTCTT 


1740 




CCCAGCACCT 


GCCGTCTACC 


CCGTCCTCTC 


CACGTGTTCC 


GGGCGCGCTG 


GCTGCTGCGA 


1800 




AAGCGGCGAA 


ATACGGTGCA 


GCGGTTCCGG 


GTGTACTGGG 


CGGTCTGGGT 


GCTCTGGGCG 


1860 




GTGTTGGTAT 


CCCGGGCGGT 


GTTGTAGGTG 


CAGGCCCAGC 


TGCAGCTGCT 


GCTGCGGCAA 


1920 




AGGCAGCGGC 


GAAAGCAGCT 


CAGTTCGGTC 


TGGTTGGTGC 


AGCAGGTCTG 


GGCGGTCTGG 


1980 


35 


GTGTTGGCGG 


TCTGGGTGTA 


CCGGGCGTTG 


GTGGTCTGGG 


TGGCATCCCG 


CCGGCGGCGG 


2040 




CAGCTAAAGC 


GGCTAAATAC 


GGTGCAGCAG 


GTCTGGGTGG 


CGTTCTGGGT 


GGTGCTGGTC 


2100 




AGTTCCCACT 


GGGCGGTGTA 


GCGGCACGTC 


CGGGTTTCGG 


TCTGTCCCCG 


ATCTTCCCAG 


2160 




GCGGTGCATG 


CCTGGGTAAA 


GCTTGCGGCC 


GTAAACGTAA 


ATAATGATAG 




2210 



VVU 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 733 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Ser Met Gly Gly Val Pro Gly Ala He Pro Gly Gly Val Pro Gly Gly 

15 10 15 

Val Phe Tyr Pro Gly Ala Gly Leu Gly Ala Leu Gly Gly Gly Ala Leu 

20 25 30 

Gly Pro Gly Gly Lys Pro Leu Lys Pro Val Pro Gly Gly Leu Ala Gly 

35 40 45 

Ala Gly Leu Gly Ala Gly Leu Gly Ala Phe Pro Ala Val Thr Phe Pro 

50 55 60 

Gly Ala Leu Val Pro Gly Gly Val Ala Asp Ala Ala Ala Ala Tyr Lys 
65 70 75 80 

Ala Ala Lys Ala Gly Ala Gly Leu Gly Gly Val Pro Gly Val Gly Gly 

85 90 95 

Leu Gly Val Ser Ala Gly Ala Val Val Pro Gin Pro Gly Ala Gly Val 

100 ■ 105 110 

Lys Pro Gly Lys Val Pro Gly Val Gly Leu Pro Gly Val Tyr Pro Gly 

115 120 125 

Gly Val Leu Pro Gly Ala Arg Phe Pro Gly Val Gly Val Leu Pro Gly 

130 135 140 

Val Pro Thr Gly Ala Gly Val Lys Pro Lys Ala Pro Gly Val Gly Gly 
145 150 155 160 

Ala Phe Ala Gly He Pro Gly Val Gly Pro Phe Gly Gly Pro Gin Pro 

165 170 175 

Gly Val Pro Leu Gly Tyr Pro He Lys Ala Pro Lys Leu Pro Gly Gly 

180 185 190 

Tyr Gly Leu Pro Tyr Thr Thr Gly Lys Leu Pro Tyr Gly Tyr Gly Pro 

195 200 205 

Gly Gly Val Ala Gly Ala Ala Gly Lys Ala Gly Tyr Pro Thr Gly Thr 

210 215 220 

Gly Val Gly Pro Gin Ala Ala Ala Ala Ala Ala Ala Lys Ala Ala Ala 
225 230 235 240 

Lys Phe Gly Ala Gly Ala Ala Gly Val Leu Pro Gly Val Gly Gly Ala 

245 250 255 

Gly Val Pro Gly Val Pro Gly Ala He Pro Gly He Gly Gly He Ala 

260 265 270 

Gly Val Gly Thr Pro Ala Ala Ala Ala Ala Ala Ala Ala Ala Ala Lys 

275 280 285 

Ala Ala Lys Tyr Gly Ala Ala Ala Gly Leu Val Pro Gly Gly Pro Gly 
290 295 300 
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Phe Gly Pro Gly Val Val Gly Val Pro Gly Ala Gly Val Pro Gly Val 
305 310 315. 320 

Gly Val Pro Gly Ala Gly He Pro Val Val Pro Gly Ala Gly He Pro 

325 330 335 

Gly Ala Ala Val Pro Gly Val Val Ser Pro Glu Ala Ala Ala Lys Ala 

340 345 350 

Ala Ala Lys Ala Ala Lys Tyr Gly Ala Arg Pro Gly Val Gly Val Gly 

355 360 365 

Gly He Pro Thr Tyr Gly Val Gly Ala Gly Gly Phe Pro Gly Phe Gly 

370 375 380 

Val Gly Val Gly Gly He Pro Gly Val Ala Gly Val Pro Ser Val Gly 
385 390 395 400 

Gly Val Pro Gly Val Gly Gly Val Pro Gly Val Gly He Ser Pro Glu 

405 410 415 

Ala Gin Ala Ala Ala Ala Ala Lys Ala Ala Lys Tyr Gly Val Gly Thr 

420 425 430 

Pro Ala Ala Ala Ala Ala Lys Ala Ala Ala Lys Ala Ala Gin Phe Gly 

435 440 445 

Leu Val Pro Gly Val Gly Val Ala Pro Gly Val Gly Val Ala Pro Gly 

450 455 460 

Val Gly Val Ala Pro Gly Val Gly Leu Ala Pro Gly Val Gly Val Ala 
465 470 475 480 

Pro Gly Val Gly Val Ala Pro Gly Val Gly Val Ala Pro Gly He Gly 

485 490 495 

Pro Gly Gly Val Ala Ala Ala Ala Lys Ser Ala Ala Lys Val Ala ,Ala 

500 505 510 

Lys Ala Gin Leu Arg Ala Ala Ala Gly Leu Gly Ala Gly He Pro Gly 

515 520 525 

Leu Gly Val Gly Val Gly Val Pro Gly Leu Gly Val Gly Ala Gly Val 

530 535 540 

Pro Gly Leu Gly Val Gly Ala Gly Val Pro Gly Phe Gly Ala Gly Ala 
545 550 555 560 

Asp Glu Gly Val Arg Arg Ser Leu Ser Pro Glu Leu Arg Glu Gly Asp 

565 570 575 

Pro Ser Ser Ser Gin His Leu Pro Ser Thr Pro Ser Ser Pro Arg Val 

580 585 590 

Pro Gly Ala Leu Ala Ala Ala Lys Ala Ala Lys Tyr Gly Ala Ala Val 

595 600 605 

Pro Gly Val Leu Gly Gly Leu Gly Ala Leu Gly Gly Val Gly He Pro 

610 615 620 

Gly Gly Val Val Gly Ala Gly Pro Ala Ala Ala Ala Ala Ala Ala Lys 
625 630 635 640 

Ala Ala Ala Lys Ala Ala Gin Phe Gly Leu Val Gly Ala Ala Gly Leu 

645 650 655 

Gly Gly Leu Gly Val Gly Gly Leu Gly Val Pro Gly Val Gly Gly Leu 
660 665 670 
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Gly Gly lie Pro Pro Ala Ala Ala Ala Lys Ala Ala Lys Tyr Gly Ala 

675 680 685 

Ala Gly Leu Gly Gly Val Leu Gly . Gly Ala Gly Gin Phe Pro Leu Gly 

690 695' 700 

Gly Val Ala Ala Arg Pro Gly Phe Gly Leu Ser Pro lie Phe Pro Gly 
705 710 715 720 

Gly Ala Cys Leu Gly Lys Ala Cys Gly Arg Lys Arg Lys 
725 730 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL : YES 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
GATCCATGGG TGGCGTTCCG GGTGCTATCC CGGGTGGCGT TCCGGGTGGT GTATTCTACC 6 0 
CAGGCGCGGG TCTGGGTGCA CTGGGCGGTG 90 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
GTGCGCTGGG CCCGGGTGGT AAACCGCTGA AACCGGTTCC AGGCGGTCTG GCAGGTGCTG 60 
GTCTGGGTGC AGGTCTGGGC GCGTTCCCGG 90 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(iii) HYPOTHETICAL: YES 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
CGGTTACCTT CCCGGGTGCT CTGGTTCCGG GTGGCGTTGC AGACGCAGCT GCTGCGTACA 60 
AAGCGGCAAA GGCAGGTGCG GGTCTGGGCG GGGTAC 96 
(2) INFORMATION FOR SEQ ID NO: 6: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:€: 
CAGGTGTTGG CGGTCTGGGT GTATCTGCTG GCGCAGTTGT TCCGCAGCCG GGTGCAGGTG 
TAAAACCGGG CAAAGTTCCA GGTGTTGGTC TGCCGGGCG 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
TATACCCGGG TGGTGTTCTG CCGGGCGCGC GTTTCCCAGG TGTTGGTGTA CTGCCGGGCG 
TTCCGACCGG TGCAGGTGTT AAACCGAAGG 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
CACCAGGTGT AGGCGGCGCG TTCGCGGGTA TCCCGGGTGT TGGCCCGTTC GGTGGTCCGC 
AGCCAGGCGT TCCGCTGGGT TACCCGATCA AAGCGCCGA 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
AGCTTCCAGG TGGCTACGGT CTG CCGTACA CCACCGGTAA ACTGCCGTAC GGCTACGGTC 
CGGGTGGCGT AGCAGGTGCT GCGGGTAA 
(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 
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(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
AGCAGGCTAC CCAACCGGTA CTGGTGTTGG TCCGCAGGCT GCTGCGGCAG CTGCGGCGAA 
GGCAGCAGCA AAATTCGGCG CGGGTGCAGC 
(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 
GGGTGTTCTG CCGGGCGTAG GTGGTGCTGG CGTTCCGGGT GTTCCAGGTG CGATCCCGGG 
. CATCGGTGGT ATCGCAGGCG TAGGTACTCC GGC 
(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GGCCGCTGCG GCTGCGGCAG CTGCGGCGAA AGCAGCTAAA TACGGTGCGG CAGCAGGCCT 
GGTTCCGGGT GGTCCAGGCT TCGGT 
(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CCGGGTGTTG TAGGCGTTCC GGGTGCTGGT GTTCCGGGCG TAGGTGTTCC AGGTGCGGGC 
ATCCCGGTTG TACCGGGTGC AGGTA 
(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TCCCGGGCGC TGCGGTTCCA- GGTGTTGTAT CCCCGGAAGC GGCAGCTAAG GCTGCTGCGA 60 
AAGCTGCGAA ATACGGAGCT 80 
(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
CGTCCGGGCG TTGGTGTTGG TGGCATCCCG ACCTACGGTG TAGGTGCAGG CGGTTTCCCA 60 
GGTTTCGGCG TTGGTGTTGG TGGCATCCCG GG 92 
(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TGTAGCTGGT GTTCCGTCTG TTGGTGGCGT ACCGGGTGTT GGTGGCGTTC CAGGTGTAGG 6 0 
TATCTCCCCG GAAGCGCAGG CAGCTGCGGC 90 
(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AG CTAAAGC A GCGAAGTACG GCGTTGGTAC TCCGGCGGCA GCAGCTGCTA AAGCAGCGGC 60 
TAAAGCAGCG CAGTTCGGA 79 
(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CTAGTTCCGG GCGTAGGTGT TGCGCCAGGT GTTGGCGTAG CACCGGGTGT TGGTGTTGCT 60 
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CCGGGCGTAG GTCTGGCACC GGGTGTTGGC GTTG 94 
(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CACCAGGTGT AGGTGTTGCG CCGGGCGTTG GTGTAGCACC GGGTATCGGT CCGGGTGGCG 60 
TTGCGGCTGC TGCGAAATCT GCTGCGAAGG TTGCT 95 
(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
GCGAAAGCGC AGCTGCGTGC AGCAGCTGGT CTGGGTG CGG GCATCCCAGG TCTGGGTGTA 6 0 
GGTGTTGGTG TTCCGGGCCT GGGTGTAGGT GCAGGGGTAC 10 0 

(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 
CGGGCCTGGG TGTTGGTGCA GGCGTTCCGG GTTTCGGTGC TGGCGCGGAC GAAGGTGTAC 60 
GTCGTTCCCT GTCTCCAGAA CTGCGT 86 
(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GAAGGTGACC CGTCCTCTTC CCAGCACCTG CCGTCTACCC CGTCCTCTCC ACGTGTTCCG 6 0 
GGCGCGCTGG CTGCTGCGAA AGCGGCGAAA TAC 93 
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(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
GGTGCAGCGG TTGCGGGTGT ACTGGGCGGT CTGGGTGCTC TGGGCGGTGT TGGTATCCCG 
GGCGGTGTTG TAGGTGCAGG CCCAGCTGCA 
(2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
GCTGCTGCTG CGGCAAAGGC AGCGGCGAAA GCAGCTCAGT TCGGTCTGGT TGGTGCAGCA 
GGTCTGGGCG GTCTGGGTGT TGGCGGTC 
(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
TGGGTGTACC GGGCGTTGGT GGTCTGGGTG GCATCCCGCC GGCGGCGGCA GCTAAAGCGG 
CTAAATACGG TGCAGCAGGT CTGGGTGGCG TTCTGGGT 
(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
GGTGCTGGTC AGTTCCCACT GGGCGGTGTA GCGGCACGTC CGGGTTTCGG TCTGTCCCCG 
ATCTTCCCAG GCGGTGCATG CCTGGGTAA 
(2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: NO 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
AGCTTGCGGC CGTAAACGTA AATAATGATA G 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
GCGCACCACC GCCCAGTGCA CCCAGACCCG CGCCTGGGTA GAATACACCA CCCGGAACGC 
CACCCGGGAT AGCACCCGGA ACGCCACCCA TG 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
TAACCGCCGG GAACGCGCCC AGACCTGCAC CCAGACCAGC ACCTGCCAGA CCGCCTGGAA 
CCGGTTTCAG CGGTTTACCA CCCGGGCCCA 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
CCCGCCCAGA CCCGCACCTG CCTTTGCCGC TTTGTACGCA GCAGCTGCGT CTGCAACGCC 
ACCCGGAACC AGAGCACCCG GGAAGG 
(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
CGGCAGACCA ACACCTGGAA CTTTGCCCGG TTTTACACCT GCACCCGGCT GCGGAACAAC 
TGCGCCAGCA GATACACCCA GACCGCCAAC ACCTGGTAC 
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<2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 99 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

<iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
TGGTGCCTTC GGTTTAACAC CTGCACCGGT CGGAACGCCC GGCAGTACAC CAACACCTGG 
GAAACGCGCG CCCGGCAGAA CACCACCCGG GTATACGCC 
(2) INFORMATION FOR SEQ ID NO : 6 : 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
AGCTTCGGCG CTTTGATCGG GTAACCCAGC GGAACGCCTG GCTGCGGACC ACCGAACGGG 
CCAACACCCG GGATACCCGC GAACGCGCCG CCTACACC 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CCTGCTTTAC CCGCAGCACC TGCTACGCCA CCCGGACCGT AGCCGTACGG CAGTTTACCG 
GTGGTGTACG GCAGACCGTA GCCACCTGGA 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
ACACCCGCTG CACCCGCGCC GAATTTTGCT GCTGCCTTCG CCGCAGCTGC CGCAGCAGCC 
TGCGGACCAA CACCAGTACC GGTTGGGTAG 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGCCGCCGGA GTACCTACGC CTGCGATACC ACCGATGCCC GGGATCGCAC CTGGAACACC 
CGGAACGCCA GCACCACCTA CGCCCGGCAG A 
(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GCCTGGACCA CCCGGAACCA GGCCTGCTGC CGCACCGTAT TTAGCTGCTT TCGCCGCAGC 
TGCCGCAGCC GCAGC 

(2) INFORMATION FOR SEQ ID NO: 11: - 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CACCCGGTAC AACCGGGATG CCCGCACCTG GAACACCTAC GCCCGGAACA CCAGCACCCG 
GAACGCCTAC AACACCCGGA CCGAA 
(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CCGTATTTCG CAGCTTTCGC AGCAGCCTTA GCTGCCGCTT CCGGGGATAC AACACCTGGA 
ACCGCAGCGC CCGGGATACC TG 
(2) INFORMATION FOR SEQ ID NO: 13: 
( i ) SEQUENCE . CHARACTERISTICS : 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
ATGCCACCAA CACCAACGCC GAAACCTGGG AAACCGCCTG CACCTACACC GTAGGTCGGG 
ATGCCACCAA CACCAACGCC CGGACGAGCT 
(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 
GCTGCCTGCG CTTCCGGGGA GATACCTACA CCTGGAACGC CACCAACACC CGGTACGCCA 
CCAACAGACG GAACACCAGC TACACCCGGG 
(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CTAGTCCGAA CTGCGCTGCT TTAGCCGCTG CTTTAGCAGC TGCTGCCGCC GGAGTACCAA 
CGCCGTACTT CGCTGCTTTA GCTGCCGCA 
(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(iii) HYPOTHETICAL: YES 
(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
CTGGTGCAAC GCCAACACCC GGTGCCAGAC CTACGCCCGG AGCAACACCA ACACCCGGTG 
CTACGCCAAC ACCTGGCGCA ACACCTACGC CCGGAA 
(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(iii) HYPOTHETICAL: YES 
(iv) ANTI- SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TTTCGCAGCA ACCTTCGCAG CAGATTTCGC AGCAGCCGCA ACGCCACCCG GACCGATACC 
CGGTGCTACA CCAACGCCCG GCGCAACACC TACAC 
(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CCCTGCACCT ACACCCAGGC CCGGAACACC AACACCTACA CCCAGACCTG GGATGCCCGC 
ACCCAGACCA GCTGCTGCAC GCAGCTGCGC 
(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
ACCTTCACGC AGTTCTGGAG ACAGGGAACG ACGTACACCT TCGTCCGCGC CAGCACCGAA 
ACCCGGAACG CCTGCACCAA CACCCAGGCC CGGTAC 
(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
CGCCGCTTTC GCAGCAGCCA GCGCGCCCGG AACACGTGGA GAGGACGGGG TAGACGGCAG 
GTGCTGGGAA GAGGACGGGT C 
(2) INFORMATION FOR SEQ ID NO: 21: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: YES 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GCTGGGCCTG CACCTACAAC ACCGCCCGGG ATACCAACAC CGCCCAGAGC ACCCAGACCG 
CCCAGTACAC CCGGAACCGC TGCACCGTAT TT 
(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
CACCCAGACC GCCAACACCC AGACCGCCCA GACCTGCTGC ACCAACCAGA CCGAACTGAG 
CTGCTTTCGC CGCTGCCTTT GCCGCAGCAG CAGCTGCA 
(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(iii) HYPOTHETICAL: YES 
(iv) ANTI- SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
AACGCCACCC AGACCTGCTG CACCGTATTT AGCCG CTTTA GCTGCCGCCG CCGGCGGGAT 
GCCACCCAGA CCACCAACGC CCGGTA 
(2) INFORMATION FOR SEQ ID NO: 24: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
AGCTTTACCC AGGCATGCAC CGCCTGGGAA GATCGGGGAC AGACCGAAAC CCGGACGTGC 
CGCTACACCG CCCAGTGGGA ACTGACCAGC ACCACCCAG 
(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 
GATCCTATCA TTATTTACGT TTACGGCCGC A 



(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2210 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

10 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GATCCATGGG AGGGGTCCCT GGGGCCATTC CTGGTGGAGT TCCTGGAGGA GTCTTTTATC 60 
CAGGGGCTGG TCTCGGAGCC CTTGGAGGAG GAGCGCTGGG GCCTGGAGGC AAACCTCTTA 120 
AGCCAGTTCC CGGAGGGCTT GCGGGTGCTG GCCTTGGGGC AGGGCTCGGC GCCTTCCCCG 180 
CAGTTACCTT TCCGGGGGCT CTGGTGCCTG GTGGAGTGGC TGACGCTGCT GCAGCCTATA 240 

15 AAGCTGCTAA GGCTGGCGCT GGGCTTGGTG GTGTCCCAGG AGTTGGTGGC TTAGGAGTGT 300 
CTGCAGGTGC GGTGGTTCCT CAGCCTGGAG CCGGAGTGAA GCCTGGGAAA GTGCCGGGTG 36 0 
TGGGGCTGCC AGGTGTATAC CCAGGTGGCG TGCTCCCAGG AGCTCGGTTC CCCGGTGTGG 42 0 
GGGTGCTCCC TGGAGTTCCC ACTGGAGCAG GAGTTAAGCC CAAGGCTCCA GGTGTAGGTG 480 
GAGCTTTTGC TGGAATCCCA GGAGTTGGAC CCTTTGGGGG ACCGCAACCT GGAGTCCCAC 540 

20 TGGGGTATCC CATCAAGGCC CCCAAGCTGC CTGGTGGCTA TGGACTGCCC TACACCACAG 600 
GGAAACTGCC CTATGGCTAT GGGCCCGGAG GAGTGGCTGG TGCAGCGGGC AAGGCTGGTT 660 
ACCCAACAGG GACAGGGGTT GGCCCCCAGG CAGCAGCAGC AGCGGCAGCT AAAGCAGCAG 720 
CAAAGTTCGG TGCTGGAGCA GCCGGAGTCC TCCCTGGTGT TGGAGGGGCT GGTGTTCCTG 780 
GCGTGCCTGG GGCAATTCCT GGAATTGGAG GCATCGCAGG CGTTGGGACT CCAGCTGCAG 840 

25 CTGCAGCTGC AGCAGCAGCC GCTAAGGCAG CCAAGTATGG AGCTGCTGCA GGCTTAGTGC 900 
CTGGTGGGCC AGGCTTTGGC CCGGGAGTAG TTGGTGTCCC AGGAGCTGGC GTTCCAGGTG 960 
TTGGTGTCCC AGGAGCTGGG ATT CCAGTTG TCCCAGGTGC TGGGATCCCA GGTGCTGCGG 1020 
TTCCAGGGGT TGTGTCACCA GAAGCAGCTG CTAAGGCAGC TGCAAAGGCA GCCAAATACG 1080 
GGGCCAGGCC CGGAGTCGGA GTTGGAGGCA TTCCTACTTA CGGGGTTGGA GCTGGGGGCT 1140 

30 TTCCCGGCTT TGGTGTCGGA GTCGGAGGTA TCCCTGGAGT CGCAGGTGTC CCTAGTGTCG 12 00 
GAGGTGTTCC CGGAGTCGGA GGTGTCCCGG G AGTTGG CAT TTCCCCCGAA GCTCAGGCAG 1260 
CAGCTGCCGC CAAGGCTGCC AAGTACGGAG TGGGGACCCC AGCAGCTGCA GCTGCTAAAG 1320 
CAGCCGCCAA AGCCGCCCAG TTTGGGTTAG TTCCTGGTGT CGGCGTGGCT CCTGGAGTTG 1380 
GCGTGGCTCC TGGTGTCGGT GTGGCTCCTG GAGTTGGCTT GGCTCCTGGA GTTGGCGTGG 1440 

35 CTCCTGGAGT TGGTGTGGCT CCTGGCGTTG GCGTGGCTCC CGGCATTGGC CCTGGTGGAG 1500 
TTGCAGCTGC AGCAAAATCC GCTGCCAAGG TGGCTGCCAA AGCCCAGCTC CGAGCTGCAG 1560 
CTGGGCTTGG TGCTGGCATC CCTGGACTTG GAGTTGGTGT CGGCGTCCCT GGACTTGGAG 162 0 
. TTGGTGCTGG TGTTCCTGGA CTTGGAGTTG GTGCTGGTGT TCCTGGCTTC GGGGCAGGTG 1680 
CAGATGAGGG AGTTAGGCGG AGCCTGTCCC CTGAGCTCAG GGAAGGAGAT CCCTCCTCCT 1740 

4 0 CTCAGCACCT CCCCAGCACC CCCTCATCAC CCAGGGTACC TGGAGCCCTG GCTGCCGCTA 1800 
AAGCAGCCAA ATATGGAGCA GCAGTGCCTG GGGTCCTTGG AGGGCTCGGG GCTCTCGGTG 1860 
GAGTAGGCAT CCCAGGCGGT GTGGTGGGAG CCGGACCCGC CGCCGCCGCT GCCGCAGCCA 1920 
AAGCTGCTGC CAAAGCCGCC CAGTTTGGCC TAGTGGGAGC CGCTGGGCTC GGAGGACTCG 1980 
GAGTCGGAGG GCTTGGAGTT CCAGGTGTTG GGGGCCTTGG AGGTATACCT CCAGCTGCAG 2040 

45 CCGCTAAAGC AGCTAAATAC GGTGCTGCTG GCCTTGGAGG TGTCCTAGGG GGTGCCGGGC 2100 
AGTTCCCACT TGGAGGAGTG GCAGCAAGAC CTGGCTTCGG ATTGTCTCCC ATTTTCCCAG 2160 



GTGGGGCCTG CCTGGGGAAA GCTTGTGGCC GGAAGAGAAA ATGATGATAG 2210 
(2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4045 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

10 (iv) ANTI- SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 
TTCACTGGCC GTCGTTTTAC AACGTCGTGA CTGGGAAAAC CCTGGCGTTA CCCAACTTAA 60 
TCGCCTTGCA GCACATCCCC CTTTCGCCAG CTGGCGTAAT AG CG AAGAGG CCCGCACCGA 12 0 
TCGCCCTTCC CAACAGTTGC GCAGCCTGAA TGGCGAATGG CGCCTGATGC GGTATTTTCT 180 

15 CCTTACGCAT CTGTGCGGTA TTTCACACCG CATATGGTGC ACTCTCAGTA CAATCTGCTC 24 0 
TGATGCCGCA TAGTTAAGCC AGCCCCGACA CCCGCCAACA CCCGCTGACG CGCCCTGACG 300 
GGCTTGTCTG CTCCCGGCAT CCGCTTACAG ACAAGCTGTG ACCGTCTCCG GGAGCTGCAT 3 SO 
GTGTCAGAGG TTTTCAC CGT CATCACCGAA ACGCGCGAGA CGAAAGGGCC TCGTGATACG 420 
CCTATTTTTA TAGGTTAATG TCATGATAAT AATGGTTTCT TAGACGTCAG GTGGCACTTT 4 80 

2 0 TCGGGGAAAT GTGCGCGGAA CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA 540 
TCCGCTCATG AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGTAT 6 00 
GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT GCCTTCCTGT 660 
TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT GAAGATCAGT TGGGTGCACG 720 
AGTGGGTTAC ATCGAACTGG ATCTCAACAG CGGTAAGATC CTTGAGAGTT TTCGCCCCGA 780 

2 5 AGAACGTTTT CCAATGATGA GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG 840 

TATTGACGCC GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT 900 
TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA GAGAATTATG 960 
CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC TTACTTCTGA CAACGATCGG 1020 
AGGACCGAAG GAGCTAACCG CTTTTTTGCA CAACATGGGG GATCATGTAA CTCGCCTTGA 1080 

3 0 TCGTTGGGAA CCGGAGCTGA ATGAAGCCAT AC C AAACGAC GAGCGTGACA CCACGATGCC 1140 

TGTAGCAATG GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC 12 00 
CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC TTCTGCGCTC 1260 
GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA GCCGGTGAGC GTGGGTCTCG 1320 
CGGTATCATT GCAGCACTGG GGCCAGATGG TAAGCCCTCC CGTATCGTAG TTATCTACAC 13 80 

35 GACGGGGAGT CAGGCAACTA TGGATGAACG AAATAGACAG ATCGCTGAGA TAGGTGCCTC 1440 
ACTGATTAAG CATTGGTAAC TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT 1500 
AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA ATCTCATGAC 1560 
CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA GACCCCGTAG AAAAGATCAA 162 0 
AGGATCTTCT TGAGATCCTT TTTTTCTGCG CGTAATCTGC TGCTTGCAAA CAAAAAAACC 1680 

40 ACCGCTACCA GCGGTGGTTT GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT 174 0 
AACTGGCTTC AGCAGAGCGC AGATACCAAA TACTGTTCTT CTAGTGTAGC CGTAGTTAGG 1800 
CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA TCCTGTTACC 1860 
AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG TTGGACTCAA GACGATAGTT 192 0 
ACCGGATAAG GCGCAGCGGT CGGGCTGAAC GGGGGGTTCG TGCACACAGC CCAGCTTGGA 1980 

45 GCGAACGACC TACACCGAAC TGAGATACCT ACAGCGTGAG CATTGAGAAA GCGCCACGCT 2 04 0 
TCCCGAAGGG AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG 2100 



CACGAGGGAG CTTCCAGGGG GAAACGCCTG 
CCTCTGACTT GAGCGTCGAT TTTTGTGATG 
CGCCAGCAAC GCGGCCTTTT TACGGTTCCT 
CTTTCCTGCG TTATCCCCTG ATTCTGTGGA 
5 TACCGCTCGC CGCAGCCGAA CGACCGAGCG 
GCGCCCAATA CGCAAACCGC CTCTCCCCGC 
CGACAGGTTT CCCGACTGGA AAGCGGGCAG 
CACTCATTAG GCACCCCAGG CTTTACACTT 
TGTGAGCGGA TAACAATTTC ACACAGGAAA 

10 CTGCAGGTGA TGATTATCAG CCAGCAGAGA 
TTATCTTTCC CTTTATTTTT GCTGCGGTAA 
TCCATTTACT ATGTTATGTT CTGAGGGGAG 
TGCTCAATTG TTATCAGCTA TGCGCCGACC 
TTCAGGCCAC TGACTAGCGA TAACTTTCCC 

15 CATTGGGTAC TGTGGGTTTA GTGGTTGTAA 
CTTGAAGGTA AACTCATCAC CCCCAAGTCT 
CTGCTCAGGG TCAACGAGAA TTAACATTCC 
TGCGGTCATG GAATTACCTT CAACCTCAAG 
TGTGCTTACC CATCTCTCCG CATCACCTTT 

2 0 CCCTGCCTGA ACATGAGAAA AAACAGGGTA 

ACTAACCGCT TCATACATCT CGTAGATTTC 
GCTAACTTTG AGAATTTTTG CAAGCAATGC 
ATTAAATAAA GCACCAACGC CTGACTGCCC 
GGATAAGCCA AGTTCATTTT TCTTTTTTTC 
25 AAGCTGCTCT TGTGTTAATG GTTTCTTTTT 
GGGATAAATA TCTAACACCG TGCGTGTTGA 
CATGTACTAA GGAGGTTGTA TGGAACAACG 
TGGGCAAACC AAGACAGCTA AAGATCTCTC 
TAAATTCATA TAAAAAACAT ACAGATAACC 

3 0 TTGACATAAA TACCACTGGC GGTGATACTG 

GAAGGTGACG CTCTTAAAAA TTAAGCCCTG 
TGGGGTGTGT GATACGAAAC G AAG CATTGG 
TAAACCTCCT TAGGATCCCC GGGAA 
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GTATCTTTAT AGTCCTGTCG GGTTTCGCCA 2160 
CTCGTCAGGG GGGCGGAGCC TATGGAAAAA 2220 
GGCCTTTTGC TGGCCTTTTG CTCACATGTT 2280 
TAACCGTATT ACCGCCTTTG AGTGAGCTGA 234 0 
CAGCGAGTCA GTGAGCGAGG AAGCGGAAGA 2400 
GCGTTGGCCG ATT C ATTAAT GCAGCTGGCA 2460 
TGAGCGCAAC GCAATTAATG TGAGTTAGCT 2520 
TATGCTTCCG GCTCGTATGT TGTGTGGAAT 2580 
C AG CTATG AC CATGATTACG CCAAGCTTGG 264 0 
TTAAGGAAAA CAGACAGGTT TATTGAGCGC 2700 
GTCGCATAAA AACCATTCTT CATAATTCAA 2 760 
TGAAAATTCC CCTAATTCGA TGAAGATTCT 282 0 
AGAACACCTT GCCGATCAGC CAAACGTCTC 288 0 
CACAACGGAA CAACTCTCAT TG CATGGG AT 2 94 0 
AAACACCTGA CCGCTATCCC TGATCAGTTT 3000 
GGCTATGCAG AAATCACCTG GCTCAACAGC 306 0 
GTCAGGAAAG CTTGGCTTGG AGCCTGTTGG 312 0 
CCAGAATGCA GAATCACTGG CTTTTTTGGT 318 0 
GGTAAAGGTT CTAAGCTTAG GTGAGAACAT 324 0 
CTCATACTCA CTTCTAAGTG ACGGCTGCAT 3300 
TCTGGCGATT GAAGGGCTAA ATTCTTCAAC 336 0 
GGCGTTATAA GCATTTAATG CATTGATGCC 342 0 
CATCCCCATC TTGTCTGCGA CAGATTCCTG 348 0 
ATAAATTGCT TTAAGGCGAC GTGCGTCCTC 354 0 
TGTGCTCATA CGTTAAATCT ATCACCGCAA 3600 
CTATTTTACC TCTGGCGGTG ATAATGGTTG 366 0 
CATAACCCTG AAAGATTATG CAATGCGCTT 372 0 
ACCTACCAAA CAATGCCCCC CTGCAAAAAA 3 78 0 
ATCTGCGGTG ATAAATTATC TCTGGCGGTG 384 0 
AGCACATCAG CAGGACGCAC TGACCACCAT 3900 
AAGAAGGGCA GCATTCAAAG CAGAAGGCTT 3960 
GATCCTAAGG AGGTTTAAGA TCCATGGGTT 402 0 
4045 
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CLAIMS 

1. A synthetic polynucleotide encoding the amino 
acid sequence of a tropoelastin or a variant of the 
tropoelastin wherein: 

5 all or some of the codons which hamper expression in 

the expression system in which the polynucleotide is 
to be expressed are replaced with codons more 
favourable for expression in the expression system. 

2 . A synthetic polynucleotide according to claim l 
10 wherein at least 50% of codons for any particular amino 

acid are selected to reflect preferred codon usage in the 
host of choice. 

3 . A synthetic polynucleotide according to claim 1 
or claim 2 wherein the synthetic polynucleotide excludes 

15 all or part of the 5 1 and/or 3' untranslated regions of 
the tropoelastin gene corresponding to the synthetic 
polynucleotide . 

4 . A synthetic polynucleotide according to claim 1 
or claim 2 wherein the synthetic polynucleotide excludes 

20 all or part of the tropoelastin signal peptide encoding 
sequence of the corresponding tropoelastin gene. 

5. A synthetic polynucleotide according to claim 1 
or claim 2 wherein the synthetic polynucleotide is 
prepared from assembled oligonucleotides incorporating 

25 restrition sites to facilitate assembly of the 

polynucleotide . 

6., . A...syn-thet.ic.,.pol.yau,cl£otide- according -to claim 1 

or claim 2 wherein the expression system is an E. coli 

expression system or a yeast, or other bacterial 
3 0 expression system or an insect or other eukaryotic cell 

expression system or a whole organism. 

7. A synthetic polynucleotide according to claim 6 
wherein the expression system is E. coli and at least 
50% of the base changes indicated in Figure 6 have been 

35 made. 

8. A synthetic polynucleotide according to claim l 
or claim 2 comprising the sequence depicted in Figure 
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3 (1) to 3(5) (SEQ ID NO: 1) . 

9. A synthetic polynucleotide according to claim 1 
or claim 2 fused to a polynucleotide sequence compatible 
with the host for the expression system... 
5 10. A synthetic polynucleotide according to claim 9 

where the compatible sequence is at the 5 * end of the 
polynucleotide molecule. 

11. A synthetic polynucleotide according to claim 
10 wherein the compatible polynucleotide encodes all or 

10 part of glutathione-S-transf erase . 

12 . A recombinant DNA molecule comprising a 
synthetic polynucleotide according to claim 1 or claim 2 
and vector DNA. 

13 . A recombinant DNA molecule according to claim 
15 12 wherein the vector is selected from the group 

consisting of pBR322, pBluescript II SK + , pGEX-2T, 
pTrc99A ; pET3d and derivatives of these vectors. 

14 . A plasmid selected from the group consisting of 
pSHELA, pSHELB, pSHELC and pSHELF . 

20 15 . A host transformed with a recombinant DNA 

molecule according to claim 12 or claim 13 or a plasmid 
according to claim 14 . 

16. A host according to claim 15 which host is a 
bacterium, a yeast, an insect cell or other eukaryotic 

25 cell, or a whole organism. 

17. A host according to claim 16 which is E. coli 
strain NM522 or XLl-Blue. 

18. An... expression- product..- of. -a, . host., .according- to. 

claim 15, which expression product comprises a 
30 tropoelastin or tropoelastin variant. 

19 . An expression product according to claim 18 
which is SHEL or GST-SHEL. 

20. A cross-linked expression product according to 
claim 18. 

35 21. A cross-linked expression product according to 

claim 20 which is chemically cross-linked. 

22. A cross-linked expression product according to 
claim 20 which is enzymatically cross-linked. 
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23. A cross-linked expression product according to 
claim 20 which is cross -linked by gamma irradiation. 

24. A composition comprising an expression product 
according to claim 18 or a cross-linked expression 

5 product according to claim 20 together with a 
pharmaceutical^ or veterinarally acceptable carrier. 

25. A carrier for delivery of an active agent 
comprising a coacervate of an expression product 
according to claim 18. 

10 26. A process for the preparation of an expression 

product according to claim 18 comprising: 

providing a transformed host according to claim 16; 
culturing it under conditions suitable for expression of 
the expression product; and collecting the expression 

15 product . 
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FIG. 1 
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1 GATCCATGGGTGGCGTTCCGGGTGCTATCCCGGGTGGCGTTCCGGGTGGTGTATTCTACC 60 
GTACCCACCGCAAGGCCCACGATAGGGCCCACCGCAAGGCCCACCACATAAGATGG 

SMGGVPGAIPGGVPGGVFYP 



6 1 CAGGCGCGGGTCTGGGTGCACTGGGCGGTGGTGCGCTGGGCCCGGGTGGTAAACCGCTGA 12 0 
GTCCGCGCCCAGACCCACGTGACCCGCCACCACGCGACCCGGGCCCACCATTTGGCGACT 

GAGLGALGGGA LGPGGKPLK 



121 AACCGGTTCCAGGCGGTCTGGCAGGTGCTGGTCTGGGTGCAGGTCTGGGCGCGTTCCCGG 
TTGGCCAAGGTCCGCCAGACCGTCCACGACCAGACCCACGTCCAGACCCGCGCAAGGGCC 



181 CGGTTACCTTCCCGGGTGCTCTGGTTCCGGGTGGCGTTGCAGACGCAGCTGCTGCGTACA 
GCCAATGGAAGGGCCCACGAGACCAAGGCCCACCGCAACGTCTGCGTCGACGACGCATGT 



241 AAGCGGCAAAGGCAGGTGCGGGTCTGGGCGGGGTACCAGGTGTTGGCGGTCTGGGTGTAT 300 
TTCGCCGTTTCCGTCCACGCCCAGACCCGCCCCATGGTCCACAACCGCCAGACCCACATA 

AAKAGAGLG GVPGVGGLGVS 



301 CTGCTGGCGCA^rTTGTTCCGCAGCCGGGTGCAGGTGTAAAACCGGGCAAAGTTCCAGGTG"" 360 
GACGACCGCGTCAACAAGGCGTCGGCCCACGTCCACATTTTGGCCCGTTTCAAGGTCCAC 

AGAVVPQPGAGVKPGK VPGV 



"361 TTGGTCTGCCGGGCGTATACCCGGGTGGTGTTCTGCCGGGCGCGCGTTTCCC AGGTGTTG 4 20 
AACCAGACGGCCCGCATATGGGCCCACCACAAGACGGCCCGCGCGCAAAGGGTCCACAAC 

GLPGVYPGGVLPGARFPGVG 



FIG. 3(1) 



f V^X //1U7J/UUUJJ 



421 GTGTACTGCCGGGCGTTCCG ACCGGTGCAGGTGTTAAACCGAAGGCACCAGGTGTAGGCG 4 80 
CACATGACGGCCCGCAAGGCTGGCCACGTCCACAATTTGGCTTCCGTGGTCCACATCCGC 

VLPGVP TGAGVKPKAPGVGG 



481 GCGCGTTCGCGGGTATCCCGGGTGTTGGCCCGTTCGGTGGTCCGCAGCCAGGCGTTCCGC 54 0 
CGCGCAAGCGCCCATAGGGCCCACAACCGGGCAAGCCACCAGGCGTCGGTCCGCAAGGCG 

AFAGIPGVGPFGGPQPGVPL 



TGGGTTACCCGATCAAAGCGCCGAAGCTTCCAGGTGGCTACGGTCTGCCGTACACCACCG 
ACCCAATGGGCTAGTTTCGCGGCTTCGAAGGTCCACCGATGCCAGACGGCATGTGGTGGC 

GYPIKAPKLPGGYGLPYTTG 



GTAAACTGCCGTACGGCTACGGTCCGGGTGGCGTAGCAGGTGCTGCGGGTAAAGCAGGCT 
CATTTGACGGCATGCCGATGCCAGGCCCACCGCATCGTCCACGACGCCCATTTCGTCCGA 



661 ACCCAACCGGTACTGGTGTTGGTCCGCAGGCTGCTGCGGCAGCTGCGGCG AAGGCAGCAG 
TGGGTTGGCCATGACCACAACCAGGCGTCCGACGACGCCGTCGACGCCGCTTCCGTCGTC 



721 CAAAATTCGGCGCGGGTGCAGCGGGTGTTCTGCCGGGCGTAGGTGGTGCTGGCGTTCCGG 780 
GTTTTAAGCCGCGCCCACGTCGCCCACAAGACGGCCCGCATCCACCACGACCGCAAGGCC 

K • F- - G" A G" A' K G V h P G V G- G- • A G • V- P 



781 GTGTTCCAGGTGCGATCCCGGGCATCGGTGGTATCGCAGGCGTAGGTACTCCGGCGGCCG 840 
CACAAGGTCCACGCTAGGGCCCGTAGCCACCATAGCGTCCGCATCCATGAGGCCGCCGGC 

VPGAI P G IGGIAG VGTPAAA 



841 CTGCGGCTGCGGCAGCTGCGGCGAAAGCAGCTAAATACGGTGpGGCAGCAGGCCTGGTTC 900 
GACGCCGACGCCGTCGACGCCGCTTTCGTCGATTTATGCCACGCCGTCGTCCGGACCAAG 

AAAAAAAKAAKYGAAAGLVP 
FIG. 3(2) 
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90 1 CGGGTGGTCCAGGCTTCGGTCCGGGTGTTGTAGGCGTTCCGbGTGCTGGTGTTCCGGGCG 9 60 
GCCCACCAGGTCCGAAGCCAGGCCCACAACATCCGCAAGGC(?CACGACCACAAGGCCCGC 

GGPGFGPGVVGVPGAGVPGV 

961 TAGGTGTTCCAGGTGCGGGCATCCCGGTTGTACCGGGTGC AGGTATCCCGGGCGCTGCGG 1020 
ATCCACAAGGTCCACGCCCGTAGGGCCAACATGGCCCACGTCCATAGGGCCCGCGACGCC 

GVPGAGIPVVPGAGIPGAAV 

1021 TTCCAGGTGTTGTATCCCCGGAAGCGGC AGCTAAGGCTGCTGCGAAAGCTGCGAAATACG 1080 
AAGGTCCACAACATAGGGGCCTTCGCCGTCGATTCCGACGACGCTTTCGACGCTTTATGC 

P GVVSPEAAAKAAAKAAKYG 

1081 GAGCTCGTCCGGGCGTTGGTGTTGGTGGCATCCCGACCTACGGTGT AGGTGC AGGCGGTT 1140 
CTCGAGCAGGCCCGCAACCACAACCACCGTAGGGCTGGATGCCACATCCACGTCCGCCAA 

ARPGVGVGGIPTYGVGAGGF 

1141 TCCCAGGTTTCGGCGTTGGTGTTGGTGGCATCCCGGGTGTAGCTGGTGTTCCGTCTGTTG 1200 
AGGGTCCAAAGCCGCAACCACAACCACCGTAGGGCCCACATCGACCACAAGGCAGACAAC 

PGFGVGVGGIPGVAGVPSVG 

1201 GTGGCGTACCGGGTGTTGGTGGCGTTCCAGGTGTAGGTATCTCCCCGGAAGCGCAGGCAG 12 60 
CACCGCATGGCCCACAACCACCGCAAGGTCCACATCCATAGAGGGGCCTTCGCGTCCGTC 

GVPGVG-GVPGVGI SPEAQAA 

12 61 CTGCGGCAGCTAAAGCAGCGAAGTACGGCGTTGGTACTCCGGCGGCAGCAGCTGCTAAAG 1320 
GACGCCGTCGATTTCGTCGCTTCATGCCGCAACCATGAGGCCGCCGTCGTCGACGATTTC 

AAAKAAKYGVGTP A A A A A K A 

1321 CAGCGGCTAAAGCAGCGCAGTTCGGACTAGTTCCGGGCGTAGGTGTTGCGCC AGGTGTTG 1380 
GTCGCCGATTTCGTCGCGTCAAGCCTGATCAAGGCCCGCATCCACAACGCGGTCCACAAC 

AAKAAQFGLVPGVGVAPGVG 

FIG. 3(3) 



138 1 GCGTAGCACCGGGTGTTGGTGTTGCTCCGGGCGTAGGTCTGGCACCGGGTGTTGGCGTTG 14 40 
CGCATCGTGGCCCACAACCACAACGAGGCCCGCATCCAGACCGTGGCCCACAACCGCAAC 

V APG VGVAPGVGLAP GVGVA 



14 41 CACCAGGTGTAGGTGTTGCGCCGGGCGTTGGTGTAGCACCGGGTATCGGTCCGGGTGGCG 1500 
GTGGTCCACATCCACAACGCGGCCCGCAACCACATCGTGGCCCATAGCCAGGCCCACCGC 

PGVGVAP GVGVAPGIGP GGV 



1501 TTGCGGCTGCTGCGAAATCTGCTGCGAAGGTTGCTGCGAAAGCGCAGCTGCGTGCAGCAG 
AACGCCGACGACGCTTTAGACGACGCTTCCAACGACGCTTTCGCGTCGACGCACGTCGTC 

AAAAKSAAKVAAKAQLRAAA 



15 61 CTGGTCTGGGTGCGGGCATCCC AGGTCTGGGTGTAGGTGTTGGTGTTCCGGGCCTGGGTG 
GACCAGACCCACGCCCGTAGGGTCCAGACCCACATCCACAACCACAAGGCCCGGACCCAC 

GLGAGIPGLGVGVGVPGLGV 



1621 TAGGTGCAGGGQTACCGGGCCTGGGTGTTGGTGCAGGCGTTCCGGGTTTCGGTGCTGGCG 
ATCCACGTCCCCATGGCCCGGACCCACAACCACGTCCGCAAGGCCCAAAGCCACGACCGC 



1681 CGGACGAAGGTGTACGTCGTTCCCTGl'CTCCAGAACTGCGTGAAGGTGACCCGTCCTCTT 17 40 
GCCTGCTTCCACATGCAGCAAGGGACAGAGGTCTTGAbGCACTTCCACTGGGCAGGAGAA 

DEGVRRSLSPELREGDPSSS 



1741 CCCAGCACCTGCCGTCTACCCCGTCCTCTCCACGTGTTCCGGGCGCGCTGGCTGCTGCG A 1800 
GGGTCGTGGACGGCAGATGGGGCAGGAGAGGTGCACAAGGCCCGCGCGACCGACGACGCT 

QHLPSTPSSPRVPGALAAAK 



1801 AAGCGGCGAAATACGGTGCAGCGGTTCCGGGTGTACTGGGCGGTCTGGGTGCTCTGGGCG 
. TTCGCCGCTTTATGCCACGTCGCCAAGGCCCACATGACCCGCCAGACCCACGAGACCCGC 

AAKYGAAVPGVLGGLGALGG 

FIG. 3(4) 
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1861 GTGTTGGTATCCCGGGCGGTGTTGTAGGTGCAGGCCCAGCTGCAGCTGCTGC'TGCGGCAA 1920 
CACAACCATAGGGCCCGCCACAACATCCACGTCCGGGTCGACGTCGACGACGACGCCGTT 

VGIPGGVVGAGPAAAAAAAK 



192 1 AGGCAGCGGCGAAAGCAGCTCAGTTCGGTCTGGTTGGTGCAGCAGGTCTGGGCGGTCTGG 1980 
TCCGTCGCCGCTTTCGTCGAGTCAAGCCAGACCAACCACGTCGTCCAGACCCGCCAGACC 

AAAKAAQFGLVGAAGLGGLG 



1981 GTGTTGGCGGTCTGGGTGTACCGGGCGTTGGTGGTCTGGGTGGCATCCCGCCGGCGGCGG 204 0 
CACAACCGCCAGACCCACATGGCCCGCAACCACCAGACCCACCGTAGGGCGGCCGCCGCC 

VGGLGVP GVGGLGGIPPAAA 



2041 CAGCTAAAGCGGCTAAATACGGTGCAGCAGGTCTGGGTGGCGTTCTGGGTGGTGCTGGTC 2100 
GTCGATTTCGCCGATTTATGCCACGTCGTCCAGACCCACCGCAAGACCCACCACGACCAG 

AKAAKYGAAGLGGVLGGAGQ 



2101 AGTTCCCACTGGGCGGTGTAGCGGCACGTCCGGGTTTCGGTCTGTCCCCGATCTTCCCAG 2160 
XCAAGGG.TGACCCGCCACATCGCCGTGCAGGCCCAAAGCCAGACAGGGGCTAGAAGGGTC 

FPLGGVA ARP GFGLSP I FP G 



2161 GCGGTGCATGCCTGGGTAAAGCTTGCG'GCCGTAAACGTAAATAATGATAG 2210 
CGCCACGTACGGACCCATTTCGAACGCCGGCATTTGCATTTATTACTATCCTAG 
GACLGKACGRKRK * . * * 



FIG. 3(5) 
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FIG. 4(2) 



10/21 



FIG. 5(1) 
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FIG. 5(2) 



1 GATCCATGGGTGGCGTTCCGGGTGCTATCCCGGGTGGCGTTCCGGGTGGTGTATTCTACC 
AGCTGCTT A TAACTT 



6 1 CAGGCGCGGGTCTGGGTGCACTGGGCGGTGGTGCGCTGGGCCCGGGTGGTAAACCGCTGA 
GT CACTAAA GTAC TT 



121 AACCGGTTCCAGGCGGTCTGGCAGGTGCTGGTCTGGGTGCAGGTCTGGGCGCGTTCCCGG 
GA CAG. TG CTG GC C C 



181 CGGTTACCTTCCCGGGTGCTCTGGTTCCGGGTGGCGTTGCAGACGCAGCTGCTGCGTACA 
A T G GT AGT T ACT 



241 AAGCGGCAAAGGCAGGTGCGGGTCTGGGCGGGGTACCAGGTGTTGGCGGTCTGGGTGTAT 
TT TCTGTTTC A TCTAAG 



301 CTGCTGGCGCAGTTGTTCCGCAGCCGGGTGCAGGTGTAAAACCGGGCAAAGTTCCAGGTG 
ATGG T TACAGGTG GG 



3 61 TTGGTCTGCCGGGCGTATACCCGGGTGGTGTTCTGCCGGGCGCGCGTTTCCCAGGTGTTG 
GG AT A CGCAATG C G 



421 GTGTACTGCCGGGCGTTCCGACCGGTGCAGGTGTTAAACCGAAGGCACCAGGTGTAGGCG 
G> G C T A ■ C T A- A- G G- T T 



481 GCGCGTTCGCGGGTATCCCGGGTGTTGGCCCGTTCGGTGGTCCGCAGCCAGGCGTTCCGC 
ATTTA AA ACTGA ATACA 



541 TGGGTTACCCGATCAAAGCGCCGAAGCTTCCAGGTGGCTACGGTCTGCCGTACACCACCG 
GTC GCC GT T A C A 



FIG. 6(1) 



601 GTAAACTGCCGTACGGCTACGGTCCGGGTGGCGTAGCAGGTGCTGCGGGTAAAGCAGGCT 660 
G CT TGCAAGT A CGTT 



661 ACCCAACCGGTACTGGTGTTGGTCCGCAGGCTGCTGCGGC AGCTGCGGCGAAGGCAGCAG 720 
A G A G CC AAA GATA 



721 CAAAATTCGGCGCGGGTGCAGCGGGTGTTCTGCCGGGCGTAGGTGGTGCTGGCGTTCCGG 780 
G TTA CACCTTTAG T T 



7 81 GTGTTCCAGGTGCGATCCCGGGCATCGGTGGTATCGCAGGCGTAGGTACTCCGGCGGCCG 84 0 
CGTGATTATAC TG ATA 



841 CTGCGGCTGCGGCAGCTGCGGCGAAAGCAGCTAAATACGGTGCGGCAGCAGGCCTGGTTC 900 
A A ACTG CGTATT TAG 



901 CGGG7GGTCCAGGCTTCGGTCCGGGTGTTGTAGGCGTTCCGGGTGCTGGTGTTCCGGGCG 960 
T G TC AATTCAA C AT 



9 61 TAGGTGTTCCAGGTGCGGGCATCCCGGTTGTACCGGGTGCAGGTATCCCGGGCGCTGCGG 1020 
T C A TGTA CA TG AT 



102 1 TTCCAGGTGTTGTATCCCCGGAAGCGGCAGCTAAGGCTGCTGCGAAAGCTGCGAAATACG 108 
G G A A AT A AGAC 



1081 GAG.CTCSTCCGGGGGTTG^TGTTGg^GGCATCCCGACC^ACGGTGTa GG I G CAOTCGGTr'' 1140 
GCAGCACA A TTT GTATGC 



114 1 TCCCAGGTTTCGGCGTTGGTGTTGGTGGCATCCCGGGTGTAGCTGGTGTTCCGTCTGTTG 1200 
TCCTTCACAT TACA C TAG C 



1201 GTGGCGTACCGGGTGTTGGTGGCGTTCCAGGTGTAGGTATCTCCCCGGAAGCGCAGGCAG 12 60 
ATTCACATCGATCT C T 



FIG. 6(2) 
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1261 CTGCGGCAGCTAAAGCAGCGAAGTACGGCGTTGGTACTCCGGCGGCAGCAGCTGCTAAAG 1320 
ATCCGTC AGGCAAT 



1321 CAGCGGCTAAAGCAGCGCAGTTCGGACTAGTTCCGGGCGTAGGTGTTGCGCCAGGTGTTG 1380 
C. C CC TGT TTCCGTTA 



1381 GCGTAGCACCGGGTGTTGGTGTTGCTCCGGGCGTAGGTCTGGCACCGGGTGTTGGCGTTG 14 40 
GTT C G T A T CT TTA G 



14 41 CACCAGGTGTAGGTGTTGCGCCGGGCGTTGGTGTAGCACCGGGTATCGGTCCGGGTGGCG 1500 
TTAT GTT CGTCCTCT A 



1501 TTGCGGCTGCTGCGAAATCTGCTGCGAAGGTTGCTGCGAAAGCGCAGCTGCGTGCAGCAG 15 60 
A AA C C G C C CAT 



1561 CTGGTCTGGGTGCGGGCATCCCAGGTCTGGGTGTAGGTGTTGGTGTTCCGGGCCTGGGTG 1620 
G T T TATAT CCCTATA 



1 62 1 TAGGTGCAGGGGTACCGGGCCTGGGTGTTGGTGCAGGCGTTCCGGGTTTCGGTGCTGGCG 1680 
T TTT-TATA TT TC GAT 



1681 CGGACGAAGGTGTACGTCGTTCCCTGTCTCCAGAACTGCGTGAAGGTGACCCGTCCTCTT 17 4 C 
A T G A TA G GAG CTGCAG ATC C 



1741 CCCAGCACCTGCCGTCTACCCCGTCCTCTCCACGTGTTCCGGGCGCGCTGGCTGCTGCGA 1800 
T C CAGC CAACAGATAC CT 



1801 AAGCGGCGAAATACGGTGCAGCGGTTCCGGGTGTACTGGGCGGTCTGGGTGCTCTGGGCG 18 60 
AC TA AGTGCTAGCG CT 



1861 GTGTTGGTATCCCGGGCGGTGTTGTAGGTGCAGGCCCAGCTGCAGCTGCTGCTGCGGCAA 1320 
AAC A GGACACCCC CAC 



FIG. 6(3) 



1921 AGGCAGCGGCGAAAGCAGCTCAGTTCGGTCTGGTTGGTGCAGCAGGTCTGGGCGGTCTGG 1980 
ATTC CC TCAGACTGCAAC 



1981 GTGTTGGCGGTCTGGGTGTACCGGGCGTTGGTGGTCTGGGTGGCATCCCGCCGGCGGCGG 2040 
ACAGTATAT GCTATATATA 



2041 CAGCTAAAGCGGCTAAATACGGTGCAGCAGGTCTGGGTGGCGTTCTGGGTGGTGCTGGTC 2100 
C A TTCTATCAG CG 



2101 AGTTCCCACTGGGCGGTGTAGCGGC ACGTCCGGGTTTCGGTCTGTCCCCG ATCTTCCCAG 2160 
TAAGA AATC AT TCT 



2161 GCGGTGCATGCCTGGGTAAAGCTTGCGGCCGTAAACGTAAATAATGATAG 2210 
TGC G T GGAAG 



FIG. 6(4) 
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FIG. 7 
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FIG. 8 



18/21 



pND211 (4045 bp) 

TTCACTGGCC GTCGTTTTAC AACGTCGTGA CTGGGAAAAC CCTGGCGTTA 
CCCAACTTAA TCGCCTTGCA GCACATCCCC CTTTCGCCAG CTGGCGTAAT 
AGCGAAGAGG CCCGCACCGA TCGCCCTTCC CAACAGTTGC GCAGCCTGAA 
TGGCGAATGG CGCCTGATGC GGTATTTTCT CCTTACGCAT CTGTGCGGTA 
TTTCACACCG CATATGGTGC ACTCTCAGTA CAATCTGCTC TGATGCCGCA 
TAGTTAAGCC AGCCCCGACA CCCGCCAACA CCCGCTGACG CGCCCTGACG 
■GGCTTGTCTG CTCCCGGCAT CCGCTTACAG ACAAGCTGTG ACCGTCTCCG 
GGAGCTGCAT GTGTCAGAGG TTTTCACCGT CATCACCGAA ACGCGCGAGA 
CGAAAGGGCC TCGTGATACG CCTATTTTTA TAGGTTAATG TCATGATAAT 
AATGGTTTCT TAGACGTCAG GTGGCACTTT TCGGGGAAAT GTGCGCGGAA 
CCCCTATTTG TTTATTTTTC TAAATACATT CAAATATGTA TCCGCTCATG 
AGACAATAAC CCTGATAAAT GCTTCAATAA TATTGAAAAA GGAAGAGf AT, 
GAGTATTCAA CATTTCCGTG TCGCCCTTAT TCCCTTTTTT GCGGCATTTT 
GCCTTCCTGT TTTTGCTCAC CCAGAAACGC TGGTGAAAGT AAAAGATGCT 
GAAGATCAGT TGGGTGCACG AGTGGGTTAC ATCGAACTGG ATCTCAACAG 
CGGTAAGATC CTTGAGAGTT TTCGCCCCGA AGAACGTTTT CCAATGATGA 
GCACTTTTAA AGTTCTGCTA TGTGGCGCGG TATTATCCCG TATTGACGCC 
GGGCAAGAGC AACTCGGTCG CCGCATACAC TATTCTCAGA ATGACTTGGT 
TGAGTACTCA CCAGTCACAG AAAAGCATCT TACGGATGGC ATGACAGTAA 
GAGAATTATG CAGTGCTGCC ATAACCATGA GTGATAACAC TGCGGCCAAC 
TTACTTCTGA CAACGATCGG AGGACCGAAG GAGCTAACCG CTTTTTTGCA 
CAACATGGGG GATCATGTAA CTCGCCTTGA TCGTTGGGAA CCGGAGCTGA 
ATGAAGCCAT ACCAAACGAC GAGCGTGACA CCACGATGCC TGTAGCAATG 
GCAACAACGT TGCGCAAACT ATTAACTGGC GAACTACTTA CTCTAGCTTC 
CCGGCAACAA TTAATAGACT GGATGGAGGC GGATAAAGTT GCAGGACCAC 
TTCTGCGCTC GGCCCTTCCG GCTGGCTGGT TTATTGCTGA TAAATCTGGA 
GCCGGTGAGC GTGGGTCTCG CGGTATCATT GCAGCACTGG GGCCAGATGG 
TAAGCCCTCC CGTATCGTAG TTATCTACAC GACGGGGAGT CAGGCAACTA 
TGGATGAAC G AAATAGACAG ATCGCTGAGA TAGGTGCCTC ACTGATTAAG 
SATTGGTAAE TGTCAGACCA AGTTTACTCA TATATACTTT AGATTGATTT 
AAAACTTCAT TTTTAATTTA AAAGGATCTA GGTGAAGATC CTTTTTGATA 
ATCTCATGAC CAAAATCCCT TAACGTGAGT TTTCGTTCCA CTGAGCGTCA 
GACCCCGTAG AAAAGATCAA AGGATCTTCT TGAGATCCTT TTTTTCTGCG 
CGTAATCTGC TGCTTGCAAA CAAAAAAACC ACCGCTACCA GCGGTGGTTT 
GTTTGCCGGA TCAAGAGCTA CCAACTCTTT TTCCGAAGGT AACTGGCTTC 
AGCAGAGCGC AGATACCAAA TACTGTTCTT CTAGTGTAGC CGTAGTTAGG 
CCACCACTTC AAGAACTCTG TAGCACCGCC TACATACCTC GCTCTGCTAA 
TCCTGTTACC AGTGGCTGCT GCCAGTGGCG ATAAGTCGTG TCTTACCGGG 
TTGGACTCAA GACGATAGTT AC CGGATAAG GCGCAGCGGT CGGGCTGAAC 
GGGGGGTTCG TGCACACAGC CCAGCTTGGA GCGAACGACC TACACCGAAC 
TGAGATACCT ACAGCGTGAG CATTGAGAAA GCGCCACGCT TCCCGAAGGG 
AGAAAGGCGG ACAGGTATCC GGTAAGCGGC AGGGTCGGAA CAGGAGAGCG 
CACGAGGGAG CTTCCAGGGG GAAACGCCTG GTATCTTTAT AGTCCTGTCG 
GGTTTCGCCA CCTCTGA£TT GAGCGTCGAT TTTTGTGATG CTCGTCAGGG 



FIG. 9(1) 



GGGCGGAGCC 
GGCCTTTTGC 
ATTCTGTGGA 
CGCAGCCGAA 
GCGCCCAATA 
GCAGCTGGCA 
GCAATTAATG 
TATGCTTCCG 
ACACAGGAAA 
TGATTATCAG 
TTATCTTTCC 
CATAATTCAA 
CCTAATTCGA 
AGAACACCTT 
TAACTTTCCC 
TGTGGGTTTA 
CTTGAAGGTA 
GCTCAACAGC 
CTTGGCTTGG 
CCAGAATGCA 
CATCACCTTT 
ACATGAGAAA 
ACTAACCGCT 
ATTCTTCAAC 
GCATTTAATG 
CATCCCCATC 
TCTTTTTTTC 
TGTGTTAATG 



/GGGATAAATA 
ATAATGGTTG 
AAAGATTATG 
AOCTACCAAA 
ACAGATAACC 
TACCACTGGC 
GAAGGTGACG 
CAGAAGGCTT 
AGGTTTESGff' 



TATGGJ 
TGGCCTTTTG 
TAACCGTATT 
CGACCGAGCG 
CGCAAACCGC 
CGACAGGTTT 
TGAGTTAGCT 
GCTCGTATGT 
CAGCTATGAC 
CCAGCAGAGA 
CTTTATTTTT 
TCCATTTACT 
TGAAGATTCT 
GCCGATCAp C" 
CACAACGGAA 
GTGGTTGTAA 
AACTCATCAC 
CTGCTCAGGG 
AGCCTGTTGG 
GAATCACTGG 
GGTAAAGGTT 
AAACAGGGTA 
TCATACATCT 
GCTAACTTTG 
CATTGATGCC 
TTGTCTGCGA 
ATAAATTGCT 
GTTTCTTTTT 



TCTAACACCG 
CATGTACTAA 
CAATGCGCTT 
CAATGCCCCC 
ATCTGCGGTG 
GGTGATACTG 
CTCTTAAAAA 
TGGGGTGTGT 
TCCSTGGGTT- 

rtcot 
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lGCAAC 
CTCACATGTT 
ACCGCCTTTG 
CAGCGAGTCA 
CTCTCCCCGC 
CCCGACTGGA 
CACTCATTAG 
TGTGTGGAAT 
CATGATTACG 
TTAAGGAAAA 
GCTGCGGTAA 
ATGTTATGTT 
TGCTCAATTG 
CAAACGTCTC 
CAACTCTCAT 
AAACACCTGA 
CCCCAAGTCT 
TCAACGAGAA 
TGCGGTCATG 
CTTTTTTGGT 
CTAAGCTTAG 
CTCATACTCA 
CGTAGATTTC 
AGAATTTTTG 
ATTAAATAAA 
CAGATTCCTG 
TTAAGGCGAC 
TGTGCTCAIp r 



GCGGCCTTTT 
CTTTCCTGCG 
AGTGAGCTGA 
GTGAGCGAGG 
GCGTTGGCCG 
AAGCGGGCAG 
GCACCCCAGG 
TGTGAGCGGA 
CCAAGCTTGG 
CAGACAGGTT 
GTCGCATAAA 
CTGAGGGGAG 
TTATCAGCTA 
TTCAGGCCAC 
TGCATGGGAT 
CCGCTATCCC 
GGCTATGCAG 
TTAACATTCC 
GAATTACCTT 
TGTGCTTACC 
GTGAGAACAT 
CTTCTAAGTG 
TCTGGCGATT 
CAAGCAATGC 
GCACCAACGC 
GGATAAGCCA 
GTGCGTCCTC 



TACGGTTCCT 
TTATCCCCTG 
TACCGCTCGC 
AAGCGGAAGA 
ATTCATTAAT 
TGAGCGCAAC 
CTTTACACTT 
TAACAATTTC 
CTGCAGGTGA 
TATTGAGCGC 
AACCATTCTT 
TGAAAATTCC 
TGCGCCGACC 
TGACTAGCGA 
CATTGGGTAC 
TGATCAGTTT 
AAATCACCTG 
GTCAGGAAAG 
CAACCTCAAG 
CATCTCTCCG 



TGCGTGTTGA 
GGAGGTTGTA 
TGGGCAAACC 
CTGCAAAAAA 
ATAAATTATC 
AGCACATCAG 
TTAAGCCCTG 
GATACGAAAC 
TAftAeCTCCT' 



CGTTAAATCT 
CTATTTTACC 
TGGAACAACG 
AAGACAGCTA 
TAAATTCATA 
TCTGGCGGTG 
CAGGACGCAC 
AAGAAGGGC A 
GAAGCATTGG 
TA GGATGC CC 



CCCTGCCTGA 
ACGGCTGCAT 
GAAGGGCTAA 
GGCGTTATAA 
CTGACTGCCC 
AGTTCATTTT 
AAGCTGCTCT 



ATCACCGCAA 

TCTGGCGGTG 

CATAACCCTG 

AAGATCTCTC 

TAAAAAACAT 

TTGACATAAA 

TGACCACCAT 

GCATTCAAAG 

GATCCTAAGG K8S 

GGGAA^o*^!- 



FIG. 9(2) 
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FIG. 10 
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ED net pmol/gly 
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